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Introduction 

We  have  undertaken  two  separate  projects  to  gain  further  insights  into  the  means  by 
which  cell  proliferation  is  controlled  by  cyclin  dependent  kinases.  First,  a  screen  has  been 
carried  out  for  proteins  that  interact  with  the  budding  yeast  cdk,  Cdc28.  Ten  novel 
interacting  proteins  have  been  identified  in  a  two-hybrid  screen  using  a  Gal4/Cdc28 
fusion  as  the  bait.  For  each  of  the  positives  the  nature  of  the  Cdc28  interaction  is  being 
determined  (i.e.  whether  substrate  or  part  of  a  complex).  Second,  using  recently 
developed  DNA  microarray  technology  we  have  characterised  yeast  cyclin  specificity  in 
terms  of  their  ability  to  induce  and  repress  transcription  of  other  yeast  genes.  In  addition, 
in  collaboration  with  Dr.  Botstein’s  group  at  Stanford  University,  we  have  sought  to 
define  the  complete  set  of  cell  cycle-regulated  genes  in  yeast.  The  microarray  project  has 
now  been  completed  and  a  paper  recently  published  (1998  MBC  9  3273-3297). 
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Body 

Specificity  of  Cdc28-interacting  two-hvbrid  positives 

A  two-hybrid  screen  was  carried  out  using  a  Gal4/Cdc228  fusion  protein  as  bait.  In 
collaboration  with  Stan  Field’s  laboratory,  the  Gal4/Cdc28  fusion  was  tested  for  two 
hybrid  interactions  against  a  library  containing  all  budding  yeast  ORFs  fused  to  the  Gal4 
activation  domain.  From  this  screen  15  different  positives  were  isolated,  which  included 
five  of  the  known  yeast  cyclins  (Clnl,  Cln2,  Cln3,  Clbl  and  Pcl7).  The  other  10  positives 
were  all  novel  interactions  and  I  decided  to  concentrate  on  these  for  further 
characterisation. 

Of  the  many  cyclins  in  yeast  the  function  of  the  G1  cyclins,  Clnl-3,  the  S-phase  cyclins 
Clb5  and  6,  and  the  mitotic  cyclins  Clbl -4  are  best  understood.  For  each  of  the  ten  non- 
cyclin  positives  it  would  be  very  useful  to  know  which  of  the  cyclin-cdc28  kinase 
complexes  they  interacted  with.  Therefore  it  was  decided  to  test  the  10  positives  for  two- 
hybrid  interactions  using  different  cyclins  as  baits.  To  do  this  Clb2,  Clb5  and  Cln3 
fusions  were  constructed  and  the  plasmids  transformed  into  a  reporter  strain  harbouring 
the  Cdc28  interacting  positives.  Transformed  cells  were  plated  out  on  leu-trp-  SC  plates 
and  single  colonies  obtained.  Patched  colonies  on  fresh  plates  were  then  replica-plated  to 
his-,  his-  +3mM  3AT,  his-  -t-lOmM  3AT  plates  and  incubated  at  30C  for  several  days. 
From  this  the  following  results  were  obtained: 


(+  Indicates  growth  on  his-  plates) 

Cvclin  bait  used  in  two-hvbrid  assay 


GENE  CLN3 

STBl 

SPC42 

ARP8 

YKR077W 

YOR066W 

YDR130C 

YPL0142W  -l- 

YPL070W 

YKL014C 

CAKl 


CLB5 

+ 


+ 

+ 


CLB2 

+ 

+ 

+ 


Significantly,  all  of  the  positive  interactions  that  were  noted  in  the  table  above  were  also 
observed  on  his-  plates  containing  either  3  or  lOmM  3AT. 
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Our  finding  that  six  of  the  positives  showed  no  significant  interactions  with  the  cyclin 
baits  used  suggets  that  they  may  be  interacting  with  Cdc28  via  other  cyclin  complexes.  I 
have  therefore  undertaken  to  construct  Cln2  and  Clb3  two-hybrid  baits  to  use  in  further 
tests  of  the  cyclin  specificity  of  the  10  Cdc28  positives. 


Deletion  analysis  of  Avkr077w  and  Avor066w  mutant  strains 

Probably  the  most  exciting  result  from  the  Cdc28  two-hybrid  screen  was  the 
isolation  of  two  positives  that  are  homologs,  YKR077W  and  YOR066W.  From  our 
analysis  of  all  cell  cycle-regulated  genes  in  yeast  we  have  established  that  both  genes  are 
cell  cycle  regulated  at  the  Gl/S  boundary.  Furthermore  YOR066W  encodes  a  protein  that 
has  several  potential  Cdc28  phosphorylation  sites,  making  it  a  good  candidate  for  a  Cdk 
substrate.  I  decided  to  carry  out  a  deletion  analysis  of  both  genes  to  see  whether  they 
carried  out  an  essential  function.  Strains  were  constructed  in  which  YKR077W  was 
deleted  and  replaced  with  the  URA3  gene,  and  YOR066W  was  deleted  and  replaced  with  a 
Kanamycin  resistance  gene.  Both  Aykr077w:URA3  and  Ayor066w:KANMX  haploid 
strains  were  found  to  be  viable.  Furthermore  both  single  deletion  strains  show  little 
difference  in  growth  rates  compared  to  that  of  an  isogenic  wild  type  control  strain.  I  next 
crossed  the  Aykr077w:URA3  strain  to  the  Ayor066w:KANMX  strain.  The  resulting  diploid 
was  sporulated  and  several  tetrads  dissected.  Surprisingly,  spores  deleted  for  both 
YKR077W  and  YOR066W  were  readily  obtained,  demonstrating  that  the  two  genes, 
though  homologs,  do  not  carry  out  a  common  essential  function.  Furthermore,  similar  to 
the  single  deletions,  I  have  been  unable  to  detect  any  significant  growth  or  morphological 
defects  in  Aykr077w  and  Ayor066w  cells  compared  to  an  isogenic  wildtype  control 
strain. 


Nature  of  the  interaction  between  Ykro77w  and  Cdc28 

Given  that  we  have  identified  10  novel  proteins  that  interact  with  Cdc28  the  most 
important  thing  to  do  next  is  to  confirm  the  interactions  biochemically.  The  two  likeliest 
possibilities  are  that  the  proteins  are  substrates  of  the  Cdc28  kinase,  or  that  they  are 
constituents  of  cyclin— Cdc28  protein  complexes.  With  this  aim  in  mind  I  have  undertaken 
to  insert  an  epitope  tag  at  the  N  or  C  terminus  of  each  of  the  ten  candidate  proteins.  This 
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will  allow  for  detection  of  the  proteins  by  Western  blotting,  and  enable  simple 
biochemical  experiments  to  be  carried  out. 

To  date  I  have  managed  to  test  whether  the  protein  encoded  by  the  YKR077W 
interacts  with  Cdc28.  A  plasmid  was  obtained  containing  YKR077W  under  the  control  of 
the  GAL  promoter.  In  addition  the  gene  carried  an  epitope  tag  at  the  C-terminus.  By 
Western  blotting  a  band  was  detected  that  migrated  at  a  size  consistent  with  the  predicted 
molecular  weight  of  the  Ykr077w  protein.  However  the  protein  migrates  as  a  single  band 
rather  than  a  doublet.  If  Ykr077w  were  a  kinase  substrate  it  might  be  expected  that  the 
phosphorylated  and  unphosphorylated  forms  of  the  protein  would  migrate  on  a 
polyacrlyamide  gel  at  different  rates,  i.e.  a  doublet  would  be  observed.  The  fact  that  this 
is  not  observed  does  not  rule  out  the  possibility  that  Ykr077w  is  a  cdk  substrate,  but  it 
does  make  it  more  difficult  to  establish. 

Next  I  endeavoured  to  Co-immunoprecipitate  the  Ykro77w  and  Cdc28  proteins. 
To  detect  Cdc28  I  used  either  polyclonal  antibody  raised  against  the  Cdc28  protein  or 
PSTAIR  antibody,  which  recognises  a  conserved  motif  in  the  protein  structure  of  all 
cyclin-dependent  kinases.  However,  despite  numerous  attempts  using  different  amounts 
of  protein  extracts,  and  varying  other  parameters,  I  have  yet  to  detect  an  interaction 
between  the  two  proteins. 


Appendix 

Key  research  accomplishments 
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-Comprehensive  identification  of  cell  cycle-regulated  genes  in  yeast 
-Identification  of  10  novel  cdk  interacting  proteins 

-Establishing  that  several  of  the  novel  cdk  interacting  proteins  interact  with  specific 
cyclin-cdc28  complexes. 


Reportable  outcomes 

-Paper  published  on  the  comprehensive  analysis  of  cell  cycle-regulated  genes  in  yeast 
(1998  MBC  9  3273-3297) 


-Public  website  established  providing  a  full  description  and  complete  data  sets  of  all  cell 
cycle-regulated  genes  in  yeast; 

http://cellcycle-www.stanford.edu 


Molecular  Biology  of  the  Cell 
Vol.  9,  3273-3297,  December  1998 

Comprehensive  Identification  of  Cell  Cycle-regulated 
Genes  of  the  Yeast  Saccharomyces  cerevisiae  by 
Microarray  Hybridization^^ 

Paul  T.  Spellman,*^  Gavin  Sherlock,*^^*  Michael  Q.  Zhang,*  Vishwanath  R. 
Iyer,§  Kirk  Anders,*  Michael  B.  Eisen,*  Patrick  O.  Brown,§ll  David 
Botstein,*^  and  Bruce  Futcher* 

^Department  of  Genetics,  Stanford  University  Medical  Center,  Stanford,  California  94306-5120;  tCold 
Spring  Harbor  Laboratory,  Cold  Spring  Harbor,  New  York  11724-2209;  ^Department  of  Biochemistry, 
Stanford  University  Medical  Center,  Stanford,  California  94306-5428;  and  'Howard  Hughes  Medical 
Institute,  Stanford,  California  94305-5428 

Submitted  September  4, 1998;  Accepted  October  15,  1998 
Monitoring  Editor:  Gerald  R.  Fink 


We  sought  to  create  a  comprehensive  catalog  of  yeast  genes  whose  transcript  levels  vary 
periodically  within  the  cell  cycle.  To  this  end,  we  used  DNA  microarrays  and  samples 
from  yeast  cultures  synchronized  by  three  independent  methods:  a  factor  arrest,  elutria- 
tion,  and  arrest  of  a  cdcl5  temperature-sensitive  mutant.  Using  periodicity  and  correla¬ 
tion  algorithms,  we  identified  800  genes  that  meet  an  objective  minimum  criterion  for  cell 
cycle  regulation.  In  separate  experiments,  designed  to  examine  the  effects  of  inducing 
either  the  G1  cyclin  ClnSp  or  the  B-type  cyclin  Clb2p,  we  found  that  the  mRNA  levels  of 
more  than  half  of  these  800  genes  respond  to  one  or  both  of  these  cyclins.  Furthermore, 
we  analyzed  our  set  of  cell  cycle-regulated  genes  for  known  and  new  promoter  elements 
and  show  that  several  known  elements  (or  variations  thereof)  contain  information 
predictive  of  cell  cycle  regulation.  A  full  description  and  complete  data  sets  are  available 
at  http://cellcycle-www.stanford.edu 


INTRODUCTION 

In  1981  Hereford  and  coworkers  discovered  that  yeast 
histone  mRNAs  oscillate  in  abundance  during  the  cell 
division  cycle  (Hereford  et  al,  1981).  To  date  104  mes¬ 
sages  that  are  cell  cycle  regulated  have  been  identified 
using  traditional  methods,  and  it  was  estimated  that 
some  250  cell  cycle-regulated  genes  might  exist  (Price 
et  al.,  1991).  There  are  several  reasons  why  genes 
might  be  regulated  in  a  periodic  manner  coincident 
with  the  cell  cycle.  Such  regulation  might  be  required 
for  the  proper  functioning  of  mechanisms  that  main¬ 
tain  order  during  cell  division.  Alternatively,  regula¬ 
tion  of  these  genes  could  simply  allow  conservation  of 
resources.  Much  of  the  literature  has  focused  on  the 

A  complete  data  set  for  this  article  is  available  at 

www.molbioIcell.org. 

These  authors  contributed  equally  to  this  work. 

Corresponding  author. 


posttranscriptional  mechanisms  that  control  the  basic 
timing  of  the  cell  cycle.  However,  there  is  also  clear 
eviidence  that  trans-acting  factors  play  a  critical  role  in 
the  regulation  of  the  abundance  of  many  cell  cycle- 
regulated  transcripts. 

Most  identified  cell  cycle  controls  that  exert  influ¬ 
ence  over  mRNA  levels  do  so  at  the  level  of  transcrip¬ 
tion.  Three  major  types  of  cell  cycle  transcription  fac¬ 
tors  are  known  in  yeast,  the  MBF  and  SBF  factors, 
Mcmlp-containing  factors,  and  Swi5p/Ace2p  (Table 
1).  Many  genes  expressed  at  about  the  Gl/S  transition 
contain  MCB  or  SCB  elements  in  their  promoters  to 
which  MBF  and  SBF  bind  respectively  (for  review,  see 
Koch  and  Nasmyth,  1994).  It  is  now  apparent  that  SBF 
is  not  as  specific  for  SCBs  as  was  originally  thought 
but,  rather,  can  bind,  at  least  in  some  cases,  to  motifs 
more  closely  matching  the  MCB  consensus  (Partridge 
et  a/.,  1997).  MBF  and  SBF  are  activated  posttransla- 
tionally  by  Cln3p-Cdc28p,  and  SBF,  at  least,  is  inacti- 
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Table  1.  Transcription  factors  that  regulate  the  cell  cycle 

Complex 

Composition 

Site  name 

Site 

Reference 

SBF 

MBF 

Mcmlp 

SFF 

Ace2p 

SwiSp 

Swi6p  +  Swi4p 
Swi6p  +  Mbplp 

Mcmlp 

SFF 

Ace2p 

SwiSp 

SCB 

MCB 

MCMl 

SFF 

SWI5 

SWI5 

CACGAAA 

ACGCGT 

TTACCNAATTNGGTAA 

GTMAACAA 

ACCAGC 

ACCAGC 

Nasmyth,  1985;  Andrews  and  Herskowitz,  1989 
Lowndes  et  al.,  1991;  McIntosh  et  al.,  1991; 

Koch  et  al.,  1993 

Acton  et  al.,  1997 

Althoefer  et  al.,  1995 

Dohrmann  et  al,  1996 

Knapp  et  al,  1996 

vated  by  Clb2p-Cdc28p  (Amon  et  al,  1993).  It  is  this 
cyclin-dependent  activation  and  inactivation  that 
causes  MBF-  and  SBF-mediated  transcription  to  be  cell 
cycle  regulated.  Mcmlp  can  bind  with  other  DNA 
binding  proteins  to  mediate  a  specific  biological  effect. 
In  cooperation  with  Stel2p,  Mcmlp  directs  the  cell 
cycle  expression  of  some  genes  in  early  G1  phase 
(Oehlen  et  al,  1996).  In  cooperation  with  an  uncloned 
factor  called  "Swi  five  factor"  (SFF),  it  induces  the 
expression  of  CLBl,  CLB2,  BUD4,  and  SWI5  in  M 
(Lydall  et  al,  1991;  Sanders  and  Herskowitz,  1996). 
Finally,  possibly  acting  without  a  partner,  it  induces 
transcription  of  CLN3,  SWI4,  and  CDC6  at  the  M/Gl 
boundary  (Mclnerny  et  al,  1997),  The  Mcmlp  +  SFF 
combination  is  interesting,  because  it  is  somehow  ac¬ 
tivated  by  Clb2p-Cdc28p,  and  Mcmlp  +  SFF  then 
induces  further  transcription  of  CLB2.  Thus,  Mcmlp  is 
part  of  a  positive  feedback  loop  for  CLB2  transcription. 
Finally,  Swi5p  and  Ace2p,  which  are  transcriptionally 
controlled  by  Mcmlp  and  SFF,  are  responsible  for  the 
expression  of  many  genes  in  M  and  M/Gl  (Kovacech 
et  al,  1996),  Some  of  these  genes  are  responsible  for 
inactivating  Clb2p  and  promoting  cytokinesis,  thus 
allowing  exit  from  mitosis,  and  allowing  the  cycle  to 
begin  anew. 

Many  cell  cycle-regulated  genes  are  involved  in 
processes  that  occur  only  once  per  cell  cycle.  Such 
processes  include  DNA  synthesis,  budding,  and  cyto¬ 
kinesis.  Additionally  many  of  these  genes  are  in¬ 
volved  in  controlling  the  cell  cycle  itself,  although  in 
most  cases  it  is  unclear  whether  their  regulated  tran¬ 
scription  is  absolutely  required.  The  cell  division  cycle 
is  thus  a  complex  self-regulating  program,  such  that 


many  genes  involved  in  aspects  of  the  cell  cycle  are 
also  controlled  by  it. 

We  present  the  results  of  a  comprehensive  series  of 
experiments  aimed  at  objectively  identifying  all  pro¬ 
tein-encoding  transcripts  in  the  genome  of  Saccharo- 
myces  cerevisiae  that  are  cell  cycle  regulated.  We  used 
DNA  microarrays  to  analyze  mRNA  levels  in  cell 
cultures  that  had  been  synchronized  by  three  indepen¬ 
dent  methods.  These  data  were  analyzed  by  deriving 
a  numerical  score  based  on  a  Fourier  algorithm  (test¬ 
ing  periodicity)  and  by  a  correlation  function  that 
identified  genes  whose  RNA  levels  were  similar  to  the 
RNA  levels  of  genes  already  known  to  be  regulated  by 
the  cell  cycle.  This  protocol  allowed  us  to  include  data 
from  a  previously  published  study  (Cho  et  al,  1998). 
We  find  that  ~800  genes  are  cell  cycle  regulated, 
which  constitutes  >10%  of  all  protein-coding  genes  in 
the  genome.  We  also  find  that  about  one-half  of  these 
genes  can  be  controlled  by  the  G1  cyclin  CLN3  and/ or 
the  mitotic  cyclin  CLB2.  The  primary  data  presented  in 
this  article,  tools  for  examining  them,  and  supporting 
analyses  can  be  found  at  http://cellcycle-www. 
stanford.edu. 

MATERIALS  AND  METHODS 
Strains 

Strains  used  in  this  study  are  shown  in  Table  2. 

Media  and  Growth  Conditions 

YEP  medium  (Sherman,  1991)  was  used  in  all  experiments,  supple¬ 
mented  with  an  appropriate  carbon  source.  Carbon  sources  are 
indicated  in  the  descriptions  of  each  experiment  and  were  used  at  a 


Table  2.  Strains  used 

in  this  study 

Name 

Genotype 

Source 

DBY7286 

DBY8724 

W303a 

#31 

#245 

cdcl5-2 

MATa  GAL2  ura3 

MATa  GAL2  ura3  barl::URA3 

MATa  ade2-l  trpl-1  leu2-3,112  his3-ll,  15  ura3  canl-100  [psi'^I 

MATa  leu2  ura3  clnl::HlS3  cln2::TRPl  cdc34-2‘^  ura3-GAL-CLN3 

W303a  clblMRAS  clb2::LEU2  clb3::TRPl  clb4::HIS3  GAI-CLB2 

W303a  cdcl5-2‘^ 

This  study 

This  study 

Thomas  et  al,  1989 
Schneider  et  al,  1996 
Fitch  et  al,  1992 

Fitch  et  al,  1992 
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final  concentration  of  2%  (wt/vol),  unless  otherwise  noted.  The  pH 
of  the  YEP  for  the  a  factor  experiment  was  adjusted  to  5.5  before 
use.  The  medium  used  for  the  elutriation  was  first  passed  through 
Whatman  #1  filter  paper.  Cultures  of  yeast  were  shaken  at  250-300 
rpm,  in  a  volume  no  more  than  25%  of  the  vessel  maximum  at  the 
temperature  specified  in  the  description  of  each  experiment. 

Microarray  Manufacture 

Yeast  ORFs  were  amplified  using  gene  PAIRS  primers  (Research 
Genetics,  Huntsville,  AL).  One  hundred-microliter  PCR  reactions 
were  performed  in  96-well  PCR  plates  using  each  primer  pair  with 
the  following  reagents:  1  fxM  each  primer,  200  juM  each  dATP, 
dCTP,  dTTP,  and  dGTP,  1  x  PCR  buffer  (Perkin  Elmer-Cetus,  Nor¬ 
walk,  CT),  2  mM  MgCl2,  and  2  U  of  Taq  DNA  polymerase  (Perkin 
Elmer-Cetus).  Thermalcycling  was  performed  in  Perkin  Elmer-Ce¬ 
tus  9600  thermalcyclers  with  a  5-min  denaturation  step  at  94°C, 
followed  by  30  cycles  with  melting,  annealing,  and  extension  tem¬ 
peratures  and  times  of  94°C,  30  s;  54°C,  45  s;  and  72°C,  3  min  30  s, 
respectively.  Production  of  the  correct  PCR  product  was  verified  by 
gel  electrophoresis.  Products  deemed  to  have  failed  were  reampli¬ 
fied  either  by  repeating  the  PCR  reaction  with  the  gene  PAIRS 
primers,  ordering  custom  primers,  or  using  the  yeast  ORE  DNA 
(Research  Genetics)  as  a  template.  Reamplification  of  failed  PCRs 
used  the  same  protocol  as  initial  amplification. 

DNAs  were  prepared  and  printed  onto  microarrays  as  described 
previously  (Shalon  et  al.,  1996;  DeRisi  et  aL,  1997  [http://cmgm. 
stanford.edu/pbrown/];  Eisen  and  Brown,  1999)  with  190-/im  spac¬ 
ing  between  the  centers  of  each  element.  Each  microarray  was 
visually  inspected,  and  all  microarrays  used  in  this  study  were 
estimated  to  be  missing  <1%  of  all  elements  except  for  arrays  used 
in  the  cdcl5  experiments,  which  were  missing  —3%  of  all  elements. 

Cell  Density  and  Size  Measurements 

All  cell  size  measurements  were  made  using  a  Coulter  Counter 
Multisizer  (Coulter  Electronics,  Hialeah,  FL)  or  a  Beckman  FACScan 
workstation  (Beckman  Instruments,  Fullerton,  CA).  The  Coulter 
Counter  was  also  used  to  measure  the  cell  density  for  elutriation. 
Cell  densities  in  the  a  factor  experiment  were  measured  at  OD^qq 
using  a  Pharmacia  Ultrospec  III  spectrophotometer  (Pharmacia,  Pis- 
cataway,  NJ). 

Budding  Index  Calculations 

Each  sample  was  sonicated  for  30  s  with  a  Virsonic  300  (Virtis, 
Gardiner,  NY)  microprobe  equipped  sonicator  at  50%  power  to 
separate  divided  cells.  At  least  200  cells  were  counted  and  scored  for 
the  presence  of  a  bud. 

DNA  Content  Determination 

Samples  were  prepared  as  described  previously  (Futcher,  1993),  and 
DNA  content  was  measured  using  a  Beckman  FACScan  worksta¬ 
tion. 

Nuclear  Staining 

Cells  were  washed  in  water  and  resuspended  in  water  containing 
DAPI  at  1  jLtg/ml.  Cells  were  then  placed  on  a  glass  slide  and 
visualized  by  fluorescence  microscopy,  using  a  Zeiss  Axioplan  mi¬ 
croscope  (Carl  Zeiss,  Thornwood,  NY). 

OL  Factor-based  Synchronization 

Yeast  strain  DBY8724  was  grown  to  an  OD^qo  of  0.2  in  YEP  glucose, 
an  asynchronous  sample  was  taken,  and  a  factor  (PAN  facility, 
Beckman  Center,  Stanford  University)  was  added  to  a  concentration 
of  12  ng/ml.  After  120  min  the  a  factor  was  removed  by  pelleting 
the  cells  for  5  min  in  a  Sorvall  (Newtown,  CT)  S34  rotor  at  3000  rpm 


and  decanting  the  supernatant.  The  arrested  cells  were  resuspended 
in  fresh  YEP  glucose  to  an  OD^oo  of  0.18.  Every  7  min,  for  the  next 
140  min,  25-ml  samples  were  taken  for  RNA,  and  5-ml  samples  were 
taken  for  FACS  analysis.  At  91  min  after  release  the  OD^oo  of  the 
culture  was  reduced  to  ~0.2  from  —0.4  by  addition  of  fresh  me¬ 
dium. 

Size-based  Synchronization 

Nine  liters  of  yeast  strain  DBY7286  were  grown  in  YEP  ethanol  (2%, 
vol/vol)  at  25°C  to  a  cell  density  of  1.5  X  10^  cells/ml.  Cells  were 
pelleted  in  a  Beckman  JA-10  rotor  for  10  min.  The  supernatant  was 
saved  and  is  referred  to  as  clarified  medium.  Cells  were  resus¬ 
pended  in  300  ml  of  the  clarified  medium  and  sonicated  for  2  min 
with  a  Virsonic  300  equipped  with  a  microprobe  at  50%  power.  This 
volume  was  loaded  into  a  dual-chamber  elutriation  chamber  (Beck¬ 
man  Instruments,  Fullerton,  CA;  catalog  numbers  356940  and 
356941)  in  a  Beckman  J-6  M/E  centrifuge  equipped  with  a  JE-5.0 
elutriation  rotor.  The  elutriator  was  run  with  clarified  medium  at 
25°C.  Unbudded  daughter  cells  (400  ml  at  2.3  X  10^  cells/ml)  were 
collected  at  a  modal  cell  volume  of  17.7  fl  and  grown  at  25°C. 
Samples  were  take  every  30  min  for  the  next  6.5  h  with  independent 
samples  for  DAPI  staining  (1  ml),  FACS  analysis  (2  ml),  budding 
index  (1  ml),  and  RNA  preparation  (25  ml).  After  harvesting,  the 
samples  for  DAPI,  FACS,  and  budding  index  were  immediately 
chilled  on  ice. 

CdclS-based  Synchronization 

The  cdcl5-2  (DBY8728)  strain  was  grown  to  2.5  X  10^  cells /ml  in 
YEP  glucose  medium  at  23°C.  The  culture  was  then  shifted  to  an  air 
incubator  at  37°C  and  held  at  that  temperature  for  3.5  h.  By  this 
time,  cell  density  had  reached  6.6  X  10^'  cells /mi,  and  96%  of  the 
cells  were  large  dumbbells  characteristic  of  a  cdcl5  arrest.  The  cells 
were  then  released  from  the  cdclS  arrest  by  shifting  the  culture  to  a 
23°C  water  bath.  Samples  were  taken  every  10  min  for  300  min, 
starting  at  the  time  of  the  shift  to  the  23°C  water  bath.  By  300  min 
after  shift,  cell  density  had  reached  4  X  10^  cells/ml.  Part  of  the 
same  original  culture  was  grown  at  23°C  to  1  X  10^  cells/ml,  and 
cells  were  harvested  for  extraction  of  the  control  mRNA.  Progress 
through  the  cell  cycle  was  monitored  by  the  appearance  of  new 
buds. 

Because  cdcl5-2  cells  do  not  quantitatively  complete  cell  separa¬ 
tion  after  a  release  from  a  37°C  arrest,  FACS  analysis  is  difficult  to 
interpret.  We  therefore  followed  the  progress  of  the  cdcl5-2  cells 
through  the  cycle  by  monitoring  the  appearance  of  new  buds.  The 
first  new  buds  appeared  50  min  after  the  release  to  23°C,  when  12% 
of  the  dumbbells  had  small  buds  (usually,  two  small  buds,  one  on 
each  half  of  the  dumbbell).  The  percentage  of  dumbbells  with  small 
buds  was  52%  at  60  min,  76%  at  70  min,  96%  at  80  min,  and  virtually 
100%  at  90  min,  at  which  time  almost  all  the  dumbbells  had  not  just 
one  bud,  but  two,  one  on  each  half  of  the  dumbbell.  The  second 
round  of  small  buds  appeared  at  150  min,  when  3%  of  the  cells  had 
small  buds.  The  percentage  was  9.7%  at  160  min,  32%  at  170  min, 
68%  at  180  min,  and  81%  at  190  min.  The  third  round  of  small  buds 
appeared  at  270  min,  although  by  this  time  synchrony  was  decay¬ 
ing. 

Cln3  and  Clb2  Experiments 

For  the  Cln3  experiment  strain  31  (DBY8725)  was  grown  in  YEP 
raffinose/ galactose  (1%  each)  at  23°C  to  a  density  of  1  X  10^  cells/ 
ml.  The  cells  were  then  filtered  and  washed  with  2  vol  of  YEP  and 
resuspended  in  YEP  raffinose  at  23°C.  These  cells  arrested  because 
of  lack  of  Cln  activity  after  incubating  for  3  h.  Cdc34p  was  then 
inactivated  by  shifting  the  culture  to  37°C  for  2.5  h.  The  culture  was 
then  split,  and  galactose  was  added  to  one-half  at  a  final  concentra¬ 
tion  of  2%  (wt/vol).  Cells  from  this  culture  were  harvested  every  10 
min  for  40  min  for  RNA.  The  entire  control  culture  was  harvested  at 
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time  0.  The  experiment  was  performed  twice  (once  for  each  hybrid¬ 
ization  in  our  data  set).  Data  from  the  40-min  (first  experiment)  and 
30-min  (second  experiment)  postgalactose  samples  are  shown. 

For  the  Clb2  experiment  strain  245  (DBY8726)  was  grown  to  a 
density  of  5  X  10^  at  30°C  in  YEP  rafhnose/ galactose  (1%  each)  and 
then  centrifuged,  and  the  cells  were  washed  with  2  vol  of  YEP  and 
then  resuspended  in  YEP  raffinose.  After  6  h  DMSO  was  added  to  a 
final  concentration  of  1%,  and  nocodazole  was  added  to  a  final 
concentration  of  15  jug/ml.  The  culture  was  then  split,  and  galactose 
was  added  to  one-half  at  a  final  concentration  of  2%  (wt/vol).  Cells 
from  this  culture  were  harvested  every  10  min  for  40  min  for  RNA. 
The  entire  control  culture  was  harvested  at  time  0.  The  experiment 
was  performed  twice  (once  for  each  hybridization  in  our  data  set). 
Data  from  the  two  40  min  postgalactose  samples  are  shown. 

To  control  the  Cln3  and  Clb2  experiments  for  the  effects  of  galac¬ 
tose  addition,  strain  W303a  (DBY8727)  was  grown  in  250  ml  of  YEP 
raffinose  at  30°C  to  a  cell  density  of  1  X  10^  cells/ ml.  The  culture 
was  split  in  two,  and  to  one-half  (the  experimental  culture)  was 
added  galactose  to  a  final  concentration  of  2%  (wt/vol).  Forty 
minutes  later  both  cultures  were  harvested  for  preparation  of 
mRNA.  The  data  for  this  experiment  are  available  on  our  web  site. 

RNA  Preparation 

Samples  for  RNA  isolation  were  taken  by  pipetting  culture  directly 
into  50-ml  Falcon  (Lincoln  Park,  NJ)  tubes  containing  —20  g  of  ice  to 
quickly  chill  the  cells.  Cells  were  collected  by  spinning  for  3  min  in 
a  tabletop  centrifuge  and  then  frozen  by  immersion  in  liquid  nitro¬ 
gen  and  stored  at  -80°C  until  RNA  was  prepared.  RNA  was  pre¬ 
pared  by  adding  10  ml  of  water-saturated  phenol,  10  ml  of  sodium 
acetate  buffer  (50  mM  sodium  acetate,  10  mM  EDTA,  pH  5.0),  and  1 
ml  of  10%  SDS  (all  prewarmed  to  65°C)  to  each  frozen  cell  pellet. 
Each  mixture  was  incubated  at  65°C  for  10  min,  vortexing  vigor¬ 
ously  every  1  min  for  10  s.  After  spinning  at  1500  X  g  for  10  min,  the 
aqueous  phase  was  transferred  to  another  50-ml  conical  tube  con¬ 
taining  10  ml  of  water-saturated  phenol.  Samples  were  vortexed  for 
30  s,  and  the  spin  was  repeated.  Aqueous  phases  were  again  trans¬ 
ferred  to  a  new  50-ml  conical  tube  and  10  ml  of  phenohchloroform 
(1:1)  were  added,  followed  by  a  15-min  spin.  RNA  was  precipitated 
by  adding  the  aqueous  phase  to  an  equal  volume  of  isopropanol 
and  0.1  vol  of  3  M  sodium  acetate.  Samples  were  spun  for  30  min  at 
1500  X  ^  to  pellet  the  RNA.  Pellets  were  washed  with  70%  ethanol 
and  dried  at  room  temperature.  RNA  pellets  were  dissolved  in  TE 
(10  mM  Tris,  1  mM  EDTA,  pH  8.0)  to  -2.5  mg/ml. 

Probe  Preparation 

Total  RNA  (15  /xg)  and  6  /xg  of  oligo-dT  were  combined  in  a  total 
volume  of  15  jxl.  RNA  oligo-dT  mixtures  were  heated  to  70°C  for  1 
min  and  then  cooled  on  ice.  Three  microliters  of  25  mM  Cy3-  or 
Cy5-conjugated  dUTP  (Amersham,  Arlington  Heights,  IL),  3  ^1  of  1 
M  DTT,  6  /xl  of  first-strand  buffer  (Stratagene,  La  Jolla,  CA),  0.6  fil  of 
dNTPs  (25  mM  each  dATP,  dCTP,  and  dGTP  and  15  mM  dTTP), 
and  2  /xl  of  Superscript  II  (Stratagene)  were  added.  Each  sample  was 
then  incubated  at  42*^C  for  2  h  to  generate  Cy-labeled  cDNA.  Start¬ 
ing  RNA  was  degraded  by  addition  of  1.5  /xl  of  stop  solution  (1  N 
NaOH,  0.1  M  EDTA)  and  incubation  at  70°C  for  10  min.  Samples 
were  neutralized  by  addition  of  15  jxl  of  0.1  N  HCl  and  400  /xl  of  TE 
(10  mM  Tris,  1  mM  EDTA,  pH  7.4).  Labeled  cDNA  was  concentrated 
and  separated  from  unbound  fluor  by  separation  in  a  Centricon-30 
( Amicon,  Danvers,  M A)  until  no  further  fluor  was  visible  in  the  flow 
through,  and  the  probe  was  concentrated  to  <4  /xl. 

Micro  array  Hybridizations 

A  probe  inixture  (12  /xl)  consisting  of  Cy3-  and  Cy5-labeled  cDNAs,  3X 
SSC,  0.3%  SDS,  and  1.8  /xg//xl  yeast  tRNA  was  applied  to  each  mi¬ 
croarray.  The  microarray  was  covered  by  a  22-mm-square  coverslip 
(Fisher  Scientific,  Pittsburgh,  PA)  and  placed  in  a  custom-manu¬ 


factured  hybridization  chamber  (see  httpry/cmgm.stanford. 
edu/pbrown/).  Ten  microliters  of  water  were  placed  inside  the  hy¬ 
bridization  chamber  before  sealing,  and  the  chamber  was  placed  in  a 
65°C  water  bath.  The  microarrays  were  allowed  to  hybridize  4-6  h. 
Microarrays  were  removed  from  the  chambers  and  placed  in  standard 
histochemistry  slide  holders  where  they  were  washed  by  plimging  30 
times  in  each  of  the  following  solutions,  respectively:  2X  SSC,  0.2% 
SDS;  0.4X  SSC;  and  0.2X  SSC. 

Data  Acquisition  and  Processing 

Microarrays  were  scanned  using  a  custom-built  scanning  laser  mi¬ 
croscope.  Separate  2  X  2-cm  images  were  acquired  for  each  fluor  at 
a  resolution  of  20  /tm /pixel.  Data  were  extracted  by  manually 
superimposing  a  grid  of  boxes  over  the  combined  Cy3-Cy5  image 
so  that  each  box  contained  a  single  array  spot.  The  average  fluores¬ 
cence  intensity  for  each  fluor  within  each  box  was  recorded.  The 
local  background  was  estimated  by  averaging  the  intensities  of  the 
weakest  12%  of  the  pixels  in  each  box.  Fluorescence  ratios  were 
computed  based  on  the  background  corrected  values.  Spots  of  poor 
quality  (as  assayed  by  visual  inspection)  were  removed  from  further 
consideration.  As  a  measure  of  the  internal  consistency  of  the  data 
for  each  spot,  the  pixel-by-pixel  correlation  coefficient  between  the 
Cy3  and  Cy5  intensities  was  computed;  spots  with  low  correlation 
values  (i.e.,  <0.4)  were  excluded  from  further  analysis. 

Identification  of  mRNAs  Regulated  in  a  Cell  Cycle- 
dependent  Manner 

Data  for  each  gene  in  the  a  factor  time  series  were  extracted  from 
the  database  and  were  normalized  so  that  the  average  log2(ratio) 
over  the  course  of  the  experiments  was  equal  to  0.  A  Fourier 
transform  (Eq.  1-3)  was  applied  to  the  data  series  for  each  gene,  and 
the  resulting  vector  (C)  was  stored  for  each  gene,  where  w  is  the 
period  of  the  cell  cycle,  t  is  the  time,  <1^  is  the  phase  offset,  and  ratio(t) 
is  the  ratio  measurement  at  time  t.  We  found  that  the  magnitude  of 
the  Fourier  transform  (D,  Eq.  4)  was  unstable  for  small  variations  of 
a>,  so  we  averaged  the  vectors  of  the  transform  over  a  range  of  40 
values,  which  were  evenly  spaced  around  the  estimated  division  t 
ime  for  the  experiment  (66  ±11).  We  initially  set  the  value  of  ^  to  0. 


A  =  ^sin(ajt  -I-  4>)log2(ratio(t))  (2) 

B  ==  ^cos(a)t  -h  <I>)Iog2(ratio(t))  (2) 

C  =  (A,B)  (3) 

D  -  y/PJTW  (4) 


The  expression  profile  of  each  gene  across  the  experiments  was  then 
correlated  to  five  dijfferent  profiles  representing  genes  known  to  be 
expressed  in  Gl,  S,  G2,  M,  and  M/Gl  using  a  standard  Pearson 
correlation  function.  The  profiles  for  known  gene  classes  were  iden¬ 
tified  by  averaging  the  log2(ratio)  data  for  each  of  the  genes  known 
to  peak  in  each  of  the  five  time  periods.  The  peak  correlation  score 
was  defined  as  the  highest  correlation  value  between  the  data  series 
for  each  gene  and  each  of  the  profiles.  The  vector  calculated  by  the 
Fourier  transform  was  scaled  by  the  peak  correlation  value. 

The  above  process  was  repeated  for  the  cdcl5  experiment  (cu 
varying  between  60  and  80)  and  for  the  cdc28  data  (o)  varying 
between  80  and  100)  from  Cho  et  al.  (1998).  The  cdc28  data  set  was 
first  converted  to  ratio  style  measurements  by  dividing  each  mea¬ 
surement  by  the  average  value  of  the  measurements  for  that  gene. 
Before  this  step  it  was  necessary  to  exclude  some  data  points  that 
appeared  to  be  aberrant.  Any  data  value  where  the  two  values  on 
either  side  were  threefold  different  in  the  same  direction  were 
excluded.  Each  gene  thus  had  three  vector  scores  (one  for  each  of  the 
three  analyzed  data  series). 

To  generate  a  single  vector  for  each  gene,  we  added  the  vectors 
for  each  experiment  together.  However,  the  value  of  <I>  for  the  three 
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experiments  should  not  be  the  same,  because  the  experiments  start 
at  different  points  in  the  cell  cycle.  Therefore,  before  combining  the 
vectors  from  the  three  experiments,  constants,  <lJcdcl5  and  <I)cdc28 
(relative  to  the  a  factor  experiment),  were  calculated  for  the  cdcl5 
and  cdc28  experiments,  respectively,  that  maximized,  for  the  known 
genes,  the  average  magnitude  of  the  summed  vectors.  The  elutria- 
tion  data  were  not  included,  because  it  was  not  possible  to  calculate 
a  ^  that  maximized  the  values  of  more  than  a  handful  of  the  known 
genes.  The  a  factor  and  cdclS  vectors  were  multiplied  by  0.7,  so  that 
they  would  not  unduly  contribute  to  the  final  "aggregate  CDC 
score,"  which  was  calculated  by  taking  the  magnitude  of  this  final 
vector. 

Genes  were  ranked  by  their  aggregate  CDC  scores,  and  the  list 
was  examined  to  identify  the  positions  of  known  cell  cycle  genes 
within  it.  We  selected  a  threshold  CDC  score  that  was  exceeded  by 
91  %  of  known  cell  cycle-regulated  genes.  Altogether  800  genes  met 
or  exceeded  this  CDC  score. 


Promoter  Analysis 

Motifs  were  identified  in  the  700  bp  upstream  of  the  start  codon 
using  a  Gibbs  sampling  strategy.  Such  a  strategy  was  originally 
developed  by  Lawrence  et  al  (1993)  to  find  patterns  in  protein 
sequences  and  later  modified  by  Neuwald  et  al.  (1995)  to  take  into 
account  the  possibility  of  a  repeated  motif.  We  have  modified  these 
Gibbs  sampling  algorithms  to  allow  pattern  searches  of  DNA 
(Zhang,  unpublished  data),  for  which  functions  such  as  double¬ 
strand  search,  palindrome  symmetry,  and  submotif  inclusion  and 
exclusion  were  included.  Once  motifs  were  established  for  a  group 
or  cluster,  we  tested  their  predictive  value  by  searching  for  the  motif 
consensus  (with  specified  mismatches)  in  the  700  bp  upstream  of  the 
ATG  for  all  groups,  as  well  as  for  a  control  set  of  non-cell  cycle- 
regulated  genes,  and  compared  the  distribution  of  these  sites  in 
different  groups. 

TAQman  Assay 

The  TAQman  assay  was  performed  on  the  same  a  factor  samples 
that  were  used  in  the  microarray  hybridization  experiments.  For 
each  sample,  500  ng  of  total  RNA  were  incubated  for  15  min  with  0.1 
U//ll1  DNase  I  (amplification  grade;  Life  Technologies,  Grand  Is¬ 
land,  NY)  in  2  mM  MgCl2,  50  mM  KCI,  20  mM  Tris-HCl  (pH  8.4). 
The  reaction  was  stopped  by  adding  EDTA  to  2.5  mM  and  incubat¬ 
ing  at  65°C  for  10  min.  The  RNA  was  reverse  transcribed  using 
TAQman  reverse  transcription  reagents  (PE  Applied  Biosystems, 
Foster  City,  CA)  consisting  of  2,5  /xM  oligo-dT  16  mer,  1.25  U/jal 
MuItiScribe  reverse  transcriptase,  0.5  mM  dGTP,  dATP,  dTTP,  and 
dCTP,  0.4  U//xI  RNase  inhibitor,  50  mM  KCI,  10  mM  Tris-HCl  (pH 
8.3).  The  reaction  was  incubated  at  25°C  for  10  min,  48°C  for  30  min, 
and  then  95°C  for  5  min.  The  resulting  cDNA  served  as  a  template 
for  real-time  quantitative  PCR  as  follows,  in  which  a  fluorescent 
reporter  dye  (6-carboxy-fiuorescein  [FAM])  was  released  and  quan¬ 
titated  during  each  specific  replication  of  the  template  (Heid  et  al, 
1996).  The  cDNA  was  mixed  with  2x  TAQman  universal  PCR 
master  mix  (PE  Applied  Biosystems)  and  then  split  into  separate 
reaction  tubes  containing  gene-specific  forward  and  reverse  primers 
(900  nM  each)  and  dye-labeled  oligonucleotide  probes  (200  nM). 
Each  resultant  PCR  (25  /xl)  contained  cDNA  generated  from  5  ng  of 
RNA.  The  sequences  of  the  primers  and  probes  were  the  following: 
TUBl  primers:  forward,  5'-AAAGCCGAAGGGAGGAGAAG-3'; 
reverse,  5'-CCCTTGGAACGAACTTACCGT-3';  TUBl  probe: 
5'(FAM)-CTCCACGTTTTTCCATGAAACCGGC-(6-carboxy-tetra- 
methylrhodamine  [TAMRA])p3';  TUBl  primers:  forward,  5'-TT- 
GTCCCATTCCCACGTTTAC-3';  reverse,  5'-GATTGAGAGCCAA- 
TTGCCGT-3';  TUB!  probe:  5'(FAM)-TTCTTCATGGTCGGCTAC- 
GCTCCATT-(TAMRA)p3';  TUB5  primers:  forward,  5'-CCTGCGC- 
CTCAATTGTCTACT-3';  reverse,  5'-TTCCAGGGTGGTATGCGTG- 
3';  TUBS  probe:  5'(FAM)-CGTCGTGGAACCTTACAACACG- 
GTTTTAA-(TAMRA)p3';  PPAl  primers:  forward,  5'-TGTCGGTG- 


CTTCCAATTTGAT-3';  reverse,  5'-CATCGGAAATGGCAGCAGT- 
3';  and  PPAl  probe:  5'(FAM)-CCGGTGATACCGACAGCGAT- 
ACCA-(TAMRA)p3'.  Each  gene-specific  PCR  was  done  in  triplicate 
or  quadruplicate.  The  tubes  were  placed  in  a  PE  Applied  Biosystems 
Prism  7700  sequence  detection  system  and  were  incubated  with  the 
following  parameters:  50°C  for  2  min,  95°C  for  10  min,  followed  by 
40  cycles  of  95°C  for  15  sec  and  60°C  for  1  min.  The  computer 
program  Sequence  Detector  version  1.6.3  (PE  Applied  Biosystems) 
provided  output,  which  allowed  the  average  quantities  of  TUB 
mRNA  relative  to  PPAl  mRNA  to  be  determined  for  each  RNA 
sample.  The  TUB:PPA1  ratio  in  the  asynchronous  sample  Al  was 
arbitrarily  set  at  1,  and  the  results  from  the  other  samples  were 
adjusted  accordingly. 

RESULTS 

Experimental  Overview 

We  wished  to  identify  the  genes  whose  RNA  levels 
varied  periodically  during  the  cell  cycle.  We  initially 
obtained  microarray  data  from  synchronized  cells  and 
suitable  controls  and  analyzed  the  >400,000  measure¬ 
ments  to  obtain  objective  scores  based  on  a  Fourier 
algorithm  (which  assesses  periodicity)  and  a  correla¬ 
tion  measurement  (which  compared  our  data  with 
those  of  previously  identified  cell  cycle-regulated 
genes).  We  compared  scores  among  the  previously 
known  and  total  gene  sets  to  find  a  threshold  value  for 
deciding  the  significance  of  the  apparent  cell  cycle 
regulation.  For  completeness,  we  also  reanalyzed  the 
published  data  of  Cho  et  al  (1998).  Using  all  the  data, 
we  arrived  at  a  threshold  CDC  value  above  which  91% 
(95  of  104)  of  the  genes  previously  shown  to  be  cell 
cycle  regulated  are  included.  This  procedure  identi¬ 
fied  a  total  of  800  yeast  genes  as  being  periodically 
regulated. 

Synchronized  Cultures 

We  measured  the  relative  levels  of  mRNA  as  a  func¬ 
tion  of  time  in  cell  cultures  that  had  been  synchro¬ 
nized  in  three  independent  ways.  First  we  used  a 
pheromone  to  arrest  MATa  cells  in  Gl.  Second,  we 
used  centrifugal  elutriation  to  obtain  small  Gl  cells. 
Finally,  we  used  a  temperature-sensitive  mutation, 
cdcl5-2,  which,  at  the  restrictive  temperature,  arrests 
cells  late  in  mitosis.  We  used  three  methods  because 
each  introduces  characteristic  artifacts.  For  instance, 
use  of  pheromone  has  regulatory  consequences  char¬ 
acteristic  of  mating,  whereas  use  of  temperature-sen¬ 
sitive  mutants  can  cause  heat  shock. 

The  synchronization  experiments  differed  in  major 
ways.  First,  they  were  performed  using  different  car¬ 
bon  sources  and  at  different  temperatures,  with  the 
consequence  that  the  cells  grew  at  different  rates.  Sec¬ 
ond,  two  different  yeast  strain  backgrounds  were  used 
(S288C  and  W303),  and  finally,  cells  were  synchro¬ 
nized  at  different  points  during  the  cell  cycle.  Each 
method  produced  significant  cell  cycle  synchrony 
through  one  cell  cycle  (elutriation),  two  cycles  (a  pher- 
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omone),  or  three  cycles  (cdclS),  as  established  by  at 
least  one  of  the  following  methods  for  each  experi¬ 
ment:  bud  count,  DNA  content  analysis  (FACS)  and 
nuclear  staining  (DAPI),  as  described  in  MATERIALS 
AND  METHODS. 

RNA  was  extracted  from  each  of  the  samples  col¬ 
lected,  as  well  as  from  a  control  sample  (asynchronous 
cultures  of  the  same  cells  growing  exponentially  at  the 
same  temperature  in  the  same  medium).  Eluorescently 
labeled  cDNA  was  synthesized  using  Cy3  ("green") 
for  all  controls  and  Cy5  ("red")  for  all  experimental 
samples.  Mixtures  of  labeled  control  and  experimental 
cDNAs  were  competitively  hybridized  to  individual 
microarrays  containing  essentially  all  yeast  genes  (De- 
Risi  et  al.,  1997).  The  ratio  of  experimental  (red)  to 
control  (green)  cDNA  was  measured  by  scanning  laser 
microscopy  (Shalon  et  al.,  1996). 


Transcription  in  Response  to  the  Cyclins  ClnSp  and 
Clb2p 

To  gain  mechanistic  insight  into  the  control  of  the  ob¬ 
served  ceU  cycle  regulation,  we  identified  genes  whose 
mRNA  levels  responded  to  the  induction  of  two  well- 
characterized  cell  cycle  regulators,  ClnSp  and  Clb2p  (see 
Nasmyth,  1993).  Late  in  G1  phase,  the  Cln3p-Cdc28p 
protein  kinase  complex  activates  two  transcription  fac¬ 
tors,  MBE  and  SBF,  and  these  in  turn  promote  the  tran¬ 
scription  of  a  number  of  genes  important  for  budding 
and  DNA  synthesis  (Cross,  1995).  Later  in  the  cell  cycle, 
the  Clb2p-Cdc28p  complex  represses  the  activity  of  SBF, 
returning  the  expression  of  SBF-regulated  genes  to  low 
levels  (Amon  et  al,  1993).  Furthermore,  Clb2p-Cdc28p  is 
known  to  activate  expression  of  at  least  four  genes, 
CLBl,  CLB2,  SWI5,  and  BUD4  (Althoefer  et  al,  1995; 
Sanders  and  Herskowitz,  1996). 

To  identify  other  genes  controlled  by  Cln3p  and 
Clb2p,  we  arrested  cln~  or  clb~  cells  in  late  G1  with 
cdc34-2  for  the  CLN3  experiment  and  in  M  with  no- 
codazole  for  the  CLB2  experiment.  We  then  induced 
expression  of  CLN3  or  CLB2  without  inducing  cell 
cycle  progression.  RNA  from  the  Gl-phase  cells  ex¬ 
pressing  Cln3p  (labeled  red)  was  compared  with  con¬ 
trol  RNA  (labeled  green)  from  the  Gl-phase  cells  ar¬ 
rested  in  the  absence  of  Cln3p.  Similarly,  for  the  CLB2 
experiment,  RNA  from  M-phase  cells  expressing 
Clb2p  (labeled  red)  was  compared  with  control  RNA 
(labeled  green)  from  M-phase  cells  arrested  in  the 
absence  of  Clb2p.  In  each  case,  mRNA  levels  were 
quantitatively  measured  by  microarray  hybridization. 
In  addition,  we  performed  an  experiment  to  test  the 
effects  of  galactose  to  an  asynchronous  culture  with  no 
inducible  cychn  (see  MATERIALS  AND  METHODS). 
Genes  identified  as  strongly  affected  by  galactose  ad¬ 
dition  were  not  considered  further  in  the  Gal  cyclin 
experiments. 


Data  Analysis  and  Availability 
The  total  data  we  collected  comprise  —400,000  indi¬ 
vidual  ratio  measurements.  The  quality  and  reliability 
of  the  data  can  only  be  assessed  by  unrestricted  access 
to  all  data  in  forms  suitable  for  further  query  or  com¬ 
puter  analysis.  Therefore,  in  addition  to  the  summary 
printed  here,  we  provide  primary  data  from  two  loca¬ 
tions  on  the  Internet.  The  numerical  data  are  provided 
in  a  table  of  the  actual  ratios  measured  for  each  gene, 
on  each  array.  They  can  be  downloaded  as  a  tab- 
delimited  text  file  from  the  journal  web  site  (http:// 
www.molbiolcell.org)  or  from  a  server  at  Stanford 
(http://cellcycle-www.stanford.edu).  The  Stanford 
web  site  also  provides  images  of  the  arrays,  accessory 
data,  and  the  capability  to  browse  and  search  the 
complete  data  set.  Raw  data  are  also  available  from 
the  authors  upon  request. 

The  comprehensive  nature  of  this  work  has  another 
consequence:  in  what  follows  we  refer  by  name  to  as 
many  as  400  genes.  It  is  impractical  to  provide  detailed 
Hterature  documentation  for  each  gene  every  time  it 
appears.  Instead,  we  have  provided  references  selec¬ 
tively,  and  we  encourage  readers  to  use  the  hyperlinks  to 
the  ^ccharomyces  Genome  Database  (http://genome- 
www.stanford.edu/Saccharomyces)  and  the  Yeast  Protein 
Database  0ittp://quest7.proteome.com/YPDhome.html) 
that  will  be  provided  at  both  the  Molecular  Biology  of  the  Cell 
and  Stanford  web  sites. 

Identification  of  Cell  Cycle-regulated  Transcripts 
Combining  the  data  from  the  synchronization  experi¬ 
ments,  we  were  able  to  identify  800  genes  whose  expres¬ 
sion  is  ceU  cycle  regulated.  We  did  this  by  the  combina¬ 
tion  of  a  Former  algorithm  and  a  correlation  algorithm  as 
described  in  MATERIALS  AND  METHODS.  This  re¬ 
sulted  in  a  score  for  each  gene  that  we  refer  to  as  the 
aggregate  GDC  score.  To  illustrate  this.  Table  3  provides 
some  summary  statistics  and  examples  of  the  kinds  of 
scores  obtained  for  several  genes  (including  specific  ex¬ 
amples  that  are  and  are  not  ceU  cycle  regulated). 

In  setting  the  threshold  for  the  aggregate  GDC  score 
by  our  empirical  method,  we  intended  to  minimize 
false-positive  assessments  while  including  the  vast 
majority  of  previously  characterized  genes  that  are 
known  to  have  periodic  mRNA  levels.  Many  addi¬ 
tional  genes  showed  indications  of  cell  cycle  regula¬ 
tion  (by  visual  inspection  of  the  data  and  by  quanti¬ 
tation  using  our  algorithm),  although  we  could  not 
objectively  distinguish  this  behavior  from  noise. 

We  estimated  the  false-positive  rate  in  two  ways. 
First,  we  randomized  the  data  from  each  experiment 
(both  by  gene  and  by  time  point)  and  performed  all  of 
the  analyses  described  above.  The  randomized  data 
produced  24  "genes"  (of  nearly  6200)  with  GDC  scores 
that  exceed  the  threshold  we  used  to  classify  genes  as 
cell  cycle  regulated.  We  assume  that  this  represents  a 
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Table  3. 

Example  scores  and  statistics  for  a  collection  of  genes 

Rank  by 
score 

Score  gene  name 

Peak  to  trough 

1 

15.99  PlRl 

27.3 

9 

10.90  CLN2 

12.1 

37 

8.78  CLBl 

9.4 

82 

6.51  BUD9 

7.0 

177 

4.25  STE3 

12.8 

224 

3.55  TUB4 

4.8 

255 

3.29  DUNl 

4.2 

401 

2.37  CINS 

5.4 

407 

2.33  TUB2 

5.5 

585 

1.71  METl 

3.0 

800 

1.314  STP4 

5.9 

800 

1.314  Threshold  for  inclusion 

844 

1.28  SECS 

4.2 

861 

1.25  TUBl 

2.7 

1258 

0.92  ANPl 

3.1 

1799 

0.71  TUPl 

3.0 

2499 

0.54  TUBS 

2.7 

2673 

0.50  IME2 

3.5 

6054 

0.05  RPS8B 

10.9 

reasonable  estimate  of  the  false-positive  rate  (i.e.,  ~3% 
of  all  genes  identified  would  be  false  positives).  In  a 
second,  more  conservative  test,  we  randomized  the 
data  set  only  within  genes.  The  number  of  genes  that 
had  scores  above  our  threshold  was  about  three  times 
higher  (75  genes)  when  we  randomly  shuffled  the  data 
in  this  way.  Thus,  the  number  of  false  positives  (of  the 
800  genes  identified  as  cell  cycle  regulated)  is  likely 
<10%  and  perhaps  as  low  as  3%. 

Classifying  the  Cell  Cycle-regulated  Genes  by 
Pattern  of  Expression 

We  used  two  distinct  methods  to  classify  genes  by 
their  pattern  of  expression,  which  we  refer  to  as  '"phas- 
ing'"  (by  time  of  peak  expression)  and  "clustering"  (by 
similarity  of  expression  across  the  experiments,  which 
is  described  below).  There  is  no  simple  relationship 
between  these  two  methods,  although  there  are  com¬ 
mon  features  in  the  results.  "Phase  groups"  were  cre¬ 
ated  by  determining  the  time  of  peak  expression  for 
each  gene  (calculated  from  the  Fourier  algorithm)  and 
ordering  all  genes  by  this  time.  We  divided  this  or¬ 
dered  set  into  five  (somewhat  arbitrary)  groups 
termed  Gl,  S,  G2,  M,  and  M/Gl  that  approximate 
those  commonly  used  in  the  literature.  To  this  end  we 
used  the  published  timing  of  gene  expression  for  the 
known  genes  in  determining  which  genes  belonged  in 
which  phase  group.  Figure  lA  displays  the  800  genes 
that  we  identified,  sorted  according  to  the  phase  of 
expression.  Each  column  represents  a  time  point  in  an 
experiment,  and  each  row  represents  a  gene  that  we 
identified  as  cell  cycle  regulated.  The  ratio  of  expres¬ 
sion  that  we  measured  for  each  gene  in  each  time 


point  is  color  coded,  reflecting  the  magnitude  of  the 
ratio  of  expression  relative  to  the  average  of  that  gene, 
with  shades  of  red  indicating  an  increase  (on)  and  shades  of 
green  indicating  a  decrease  (off).  This  display  is  based  on 
the  paradigm  of  Eisen  et  al  (1998).  Genes  expressed  during 
each  part  of  the  ceU  cycle  are  indicated  by  the  color  bar  (and 
phase)  on  the  side,  and  temporal  progress  through  the  cell 
cycle  is  indicated  on  the  top. 

By  phasing  there  were  300  Gl  genes  (e.g.,  CLN2, 
RNRl,  CDC9,  RAD27,  SMC3,  and  MNNl),  71  S  genes 
(e.g.,  the  histones),  121  G2  genes  (e.g.,  CLB4,  WH/3, 
and  aS3),  195  M  genes  (e.g.,  DBF2,  CLB2,  CDC5, 
CDC20,  and  SW5),  and  113  M/Gl  genes  (e.g.,  ASHl, 
SlCl,  CDC6,  and  EGT2),  This  is  a  crude  classification 
with  many  disadvantages  (e.g.,  the  last  gene  in  the  G2 
group  and  the  first  gene  in  the  M  group  are  expressed 
at  virtually  the  same  time  yet  are  in  different  groups), 
but  nevertheless  it  is  useful  for  discussing  the  results. 

Identification  of  DNA  Binding  Sites 

We  searched  through  the  700  bp  immediately  up¬ 
stream  of  the  start  codon  of  each  of  the  800  genes  in 
our  list  to  identify  potential  binding  sites  for  known  or 
novel  factors  that  might  control  expression  during  the 
cell  cycle.  We  found  that  the  majority  of  the  genes 
have  good  matches  to  known  cell  cycle  transcription 
factor  binding  sites  relevant  to  the  time  of  peak  ex¬ 
pression.  Furthermore,  we  examined  the  distribution 
of  these  elements  within  the  upstream  sequences,  and 
found  that  both  the  site  and  its  position  relative  to  the 
ATG  contain  information  that  is  predictive  of  the 
phase  group  of  the  gene.  Figure  2  shows  the  frequency 
of  six  sites  in  promoters  of  the  Gl,  S,  G2,  M,  and  M/Gl 
phase  groups  and  a  control  set  of  non-cell  cycle-regu¬ 
lated  genes.  These  sites  are  the  previously  published 
SCB  and  MCB  as  well  as  four  extensions  and  modifi¬ 
cations  of  published  sites  (MCMl  +  SFF,  extended 
SWI5,  SCB  variant,  and  degenerate  MCB).  Full  results 
of  all  promoter  searches  are  available  on  our  web  site. 

Clusters  and  Their  Regulation 

Clusters  were  established  using  the  clustering  algo¬ 
rithm  of  Eisen  et  al  (1999).  This  algorithm  sorts 
through  all  the  data  to  find  the  pairs  of  genes  that 
behave  most  similarly  in  each  experiment  and  then 
progressively  adds  other  genes  to  the  initial  pairs  to 
form  clusters  of  apparently  coregulated  genes.  As  will 
be  discussed  below,  the  clustering  algorithm  success¬ 
fully  identifies  coregulated  genes,  because  analysis  of 
the  5'  regions  of  the  genes  in  a  cluster  shows  that  such 
genes  share  common  promoter  elements,  many  of 
which  are  identifiable  based  on  the  published  litera¬ 
ture.  Thus,  these  clusters  provide  a  foundation  for 
understanding  the  transcriptional  mechanisms  of  cell 
cycle  regulation.  Figure  IB  shows  the  entire  cluster- 
gram  of  our  cell  cycle-regulated  genes;  a  larger  ver- 
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Figure  1.  (cont). 
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Figure  2.  Binding  site  frequencies.  The  distribution  of  various  promoter  elements  in  the  upstream  regions  of  each  the  five  cell  cycle- 
regulated  groups  and  a  control  group  of  279  non-cell  cycle-regulated  genes  that  do  not  respond  to  either  Cln3p  or  Clb2p  induction  are 
graphically  displayed.  In  each  case  the  numbers  on  the  x  axes  represent  distance  from  the  start  codon,  and  the  bars  represent  the  frequency 
of  a  particular  site  per  residue  per  gene  at  that  position  in  the  upstream  promoter  regions.  N  is  any  base;  Y  is  C  or  T;  W  is  A  or  T;  R  is  A 
or  G;  and  M  is  A  or  C.  (A)  SCB.  (B)  A  variant  on  the  SCB,  which  is  also  similar  to  an  MCB.  (C)  MCB.  (D)  MCB_d,  a  degenerate  MCB  sequence. 
(E)  MCMl  /SEE  site.  (E)  Swi5e,  an  extended  Swi5  site. 


sion  with  gene  names  attached  is  available  at  our  web 
site.  The  same  color-coded  presentation  is  used,  with 
the  addition,  on  the  extreme  left,  of  the  similarity  tree 
(dendrogram)  calculated  by  the  clustering  algorithm. 
Many  portions  of  the  clustergram  (subclusters)  are 


Figure  1  (cont).  Gene  expression  during  the  yeast  cell  cycle.  Genes 
correspond  to  the  rows,  and  the  time  points  of  each  experiment  are 
the  columns.  The  ratio  of  induction /repression  is  shown  for  each 
gene  such  that  the  magnitude  is  indicated  by  the  intensity  of  the 
colors  displayed.  If  the  color  is  black,  then  the  ratio  of  control  to 
experimental  cDNA  is  equal  to  1,  whereas  the  brightest  colors  (red 
and  green)  represent  a  ratio  of  2.8:1.  Ratios  >2.8  are  displayed  as  the 
brightest  color.  In  all  cases  red  indicates  an  increase  in  mRNA 
abundance,  whereas  green  indicates  a  decrease  in  abundance.  Gray 
areas  (when  visible)  indicate  absent  data  or  data  of  low  quality. 
Color  bars  on  the  right  indicate  the  phase  group  to  which  a  gene 
belongs  (M/Gl,  yellow;  Gl,  green;  S,  purple;  G2,  red;  M,  orange). 
These  same  colors  indicate  cell  cycle  phase  along  the  top.  (A)  Gene 
expression  patterns  for  cell  cycle-regulated  genes.  The  800  genes  are 
ordered  by  the  times  at  which  they  reach  peak  expression.  (B)  Genes 
that  share  similar  expression  profiles  are  grouped  by  a  clustering 
algorithm  as  described  in  the  text.  The  dendrogram  on  the  left 
shows  the  structure  of  the  cluster. 


described  below,  and  those  that  we  discuss  are  sum¬ 
marized  in  Table  4.  The  locations  of  these  subclusters 
in  the  main  cluster  are  indicated  on  Figure  IB. 


Table  4.  Cluster  summary 

No.  of  Peak 


Cluster 

genes 

Binding  site 

Regulator 

expression 

CLN2 

119 

ACGCGT 

MBE,  SBE 

Gl 

Y' 

26 

Unknown 

Unknown 

Gl 

EKSl 

92 

ACRMSAAA 

SBE,  (MBE?) 

Gl 

Histone 

10 

ATGCGAAR 

Unknown 

S 

MET 

20 

AAACTGTGG 

Met31p,  Met32p 

S 

CLB2 

35 

MCMl  +  SEE 

Mcmlp  SEE 

M 

MCM 

34 

MCMl 

Mcmlp 

M/Gl 

SICl 

27 

RRCCAGCR 

Swi5p/Ace2p 

M/Gl 

Total 

363 

The  FKSl  set  of  genes  is  not  a  cluster,  and  the  CLN2  cluster  includes 
genes  outside  the  core  cluster  that  is  shown.  Consensus  binding 
sites  defined  in  the  literature  for  various  factors  are  as  follows: 
MCMl,  TTACCNAATTNGGTAA;  SEE,  GTMAACAA. 
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Figure  3.  The  G1  clusters.  The  transcription  profiles  are  displayed  as  described  in  the  legend  to  Figure  1.  (A)  CLN2  cluster.  A  fraction  of  the 
genes  regulated  similarly  to  the  G1  cyclin  CLN2,  which  reaches  peak  expression  in  the  G1  phase  of  the  cell  cycle.  To  view  the  full  cluster  of 
all  the  cell  cycle-regulated  genes,  visit  http:/' cellcycle-www.stanford.edu.  (B)  Y'  cluster.  Thirty-one  genes  that  are  located  within  the  Y' 
elements  show  cell  cycle  regulation  of  mRNA  levels  that  peak  in  Gl. 


The  Gl  Clusters 

The  "CLN2''  cluster  is  the  largest  subcluster  and 
contains  76  genes.  Genes  in  this  cluster  include 
CLNl,  CLN2,  CLB6,  RNRl,  CDC9,  CDC21,  CDC45, 
P0L12,  POL30,  SWEl,  and  many  other  genes  in¬ 
volved  in  DNA  replication.  A  portion  of  this  cluster 


is  shown  in  Figure  3A.  The  key  features  of  these 
genes  are  that  expression  is  strongly  cell  cycle  reg¬ 
ulated  (i.e.,  large  peak-to-trough  ratios);  peak  ex¬ 
pression  occurs  in  mid-Gl  phase  ('^10  min  before 
budding  in  the  cdcl5  experiment);  and  they  are 
strongly  induced  by  GAL-CLN3  but  are  strongly 
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repressed  by  GAL-CLB2.  Fifty-eight  percent  of  the  5' 
regions  of  these  genes  had  at  least  one  copy  of  the 
motif  ACGCGT  (vs.  6%  of  control  genes),  which  is  a 
perfect  MCB  element.  Fifty-two  percent  had  at  least 
one  copy  of  CRCGAAA  (vs.  13%  of  control  genes),  a 
degenerate  SCB  element.  In  addition,  16  had  the 
motif  AGAAGAAA,  which  is  similar  to  a  function¬ 
ally  important  sequence  found  upstream  of  CLN3 
(AAGAAAAA)  (Parviz  et  al.,  1998).  Finally,  17  had 
the  motif  CCACAK,  which  we  do  not  recognize. 
Outside  the  core  of  this  cluster  are  at  least  43  addi¬ 
tional  genes  that  are  less  tightly  clustered  but  nev¬ 
ertheless  appear  to  be  coregulated  with  the  CLN2 
cluster  (119  total  genes). 

The  "Y"'  cluster  (Figure  3B)  contains  31  ORFs  that 
all  share  DNA  sequence  similarity.  There  are  38  ORFs 
that  share  this  similarity  in  the  genome  and  we  iden¬ 
tify  36  of  them  as  cell  cycle  regulated.  All  of  these  38 
ORFs  are  found  in  Y'  elements,  located  at  chromo¬ 
somes  ends.  It  should  be  noted  that  these  results  may 
not  represent  36  independent  observations,  because 
the  cDNAs  corresponding  to  these  ORFs  are  almost 
certain  to  cross-hybridize  on  the  microarrays.  We  do 
not  know  how  these  ORFs  are  regulated  or  the  func¬ 
tional  significance. 

There  is  a  set  of  92  genes,  containing  ALG7,  FKSl, 
GASl,  GOGS,  PMTl,  and  PMI40,  as  well  as  other 
genes  involved  in  cell  wall  synthesis  (Klis,  1994), 
that  are  not  a  cluster  on  the  clustergram  but  that  are 
substantially  coregulated.  These  genes  can  be  seen 
on  our  web  site  as  Figure  3C.  Expression  is  strongly 
cell  cycle  regulated,  and  peak  expression  is  nearly 
coincident  with  budding  (~10  min  later  than  the 
CLN2  cluster  in  the  cdclS  experiment).  These  genes 
are  induced  by  GAL-CLN3  and  repressed  by  GAL- 
CLB2.  The  majority  of  these  genes  had  the  motif 
ACRMSAAA  (where  R  is  A  or  G,  M  is  A  or  C,  and 
S  is  C  or  G),  which  may  be  an  extension  and  varia¬ 
tion  of  the  SCB  motif  (CACGAAA).  Comparison  of 
the  CLN2  cluster  with  this  set  suggests  that  expres¬ 
sion  from  MCB  motifs  may  be  activated  somewhat 
before  expression  from  SCB  motifs,  but  both  kinds 
of  expression  are  induced  by  CLN3  (consistent  with 
previous  studies)  and  repressed  by  CLB2.  Earlier 
studies  demonstrated  that  repression  of  SCB-driven 
expression  requires  CLB2,  whereas  repression  of 
MCB-driven  expression  did  not  (Amon  et  al.,  1993). 
Our  results  extend  this  by  showing  that  CLB2  can 
repress  MCB-driven  expression,  even  though  there 
may  be  additional  repressive  mechanisms.  Many  of 
the  genes  in  this  set  also  had  the  motif 
AARAARAAG,  which  is  similar  to  a  motif  found  in 
the  CLN2  cluster  (see  above).  However,  because 
promoters  generally  are  rich  in  such  sequences,  the 
significance  of  this  motif  is  unclear. 


The  S  and  M  Clusters 

The  histone  cluster  in  Figure  4A  forms  the  tightest 
cluster  of  any  of  the  cell  cycle  genes.  These  nine  genes 
have  very  high  peak-to-trough  ratios  and  give  aggre¬ 
gate  scores  of  —10.  The  histones  have  three  known 
modes  of  regulation:  first,  there  are  negative  elements 
repressing  transcription;  second,  there  is  an  element  in 
the  3'  region  of  the  mRNAs  that  destabilizes  the  mes¬ 
sage  except  during  S  phase;  and  third,  there  is  a  re¬ 
peated  positive  element,  which  activates  transcription 
(Freeman  et  al,  1992).  Part  of  the  core  motif  of  the 
positive  element  is  ATGCGAAR,  which  is  similar  to 
our  degenerate  SCB  motif  (ACRMSAAA).  Consistent 
with  this,  histone  expression  is  induced  by  GAL-CLN3. 
However  it  has  been  shown  that  the  level  and  period¬ 
icity  of  HTA2/HTB2  mRNA  accumulation  are  not  no¬ 
ticeably  affected  by  single  mutation  of  SW74,  SWI6,  or 
MBPl  (Lowndes  et  al,  1992;  Cross  et  al,  1994).  Addi¬ 
tionally,  histone  levels  are  unaffected  by  GAL-CLB2. 
The  sharpness  of  the  peak  in  histone  regulation  is 
worth  noting,  both  because  it  gives  a  good  impression 
of  the  degree  of  synchronization  and  because  the  his¬ 
tones  were  the  first  genes  for  which  periodic  regula¬ 
tion  was  discovered  (Hereford  et  al,  1981). 

The  "MET"  cluster  (20  genes,  Eigure  4B)  was  com¬ 
pletely  unexpected.  It  contains  10  genes  involved  in 
the  biosynthesis  of  methionine.  Furthermore,  two  of 
the  unnamed  genes  in  this  cluster  show  sequence  sim¬ 
ilarity  to  human  methionine  synthetase,  two  are  likely 
to  be  amino  acid  transporters  (with  unidentified  spec¬ 
ificities),  one  is  similar  to  MET17,  and  one  is  on  the 
opposite  strand  of  MET2.  Finally,  ECM17,  the  only 
previously  characterized  gene  in  the  cluster  that  is  not 
known  to  be  part  of  the  methionine  biosynthetic  path¬ 
way,  is  similar  to  a  sulfite  redoxin  from  human.  Thus, 
nearly  all  of  the  genes  in  this  cluster  are  likely  to  be 
involved  in  methionine  metabolism.  Expression  of  the 
genes  in  this  cluster  peaks  just  after  the  histones,  and 
at  least  some  are  inducible  by  CLN3.  We  searched  the 
upstream  region  of  the  genes  in  the  MET  cluster  and 
found  that  15  of  the  genes  had  the  consensus  AAACT- 
GTGG,  which  is  identical  to  the  consensus  found  for 
Met31/Met32  binding  (Blaiseau  et  al,  1997). 

The  "CLB2"  cluster  (Figure  4C)  contains  35  genes 
and  includes  many  genes  involved  in  mitosis  such  as 
CLB2,  CDC5,  CDC20,  and  SWI5.  There  are  also  many 
other  less  tightly  clustered  genes  that  appear  to  be 
regulated  in  a  similar  manner,  including  WSC4,  PMPl, 
and  the  major  plasma  membrane  proton  pumps 
PMAl  and  PMA2.  The  CLB2  cluster  is  highly  regu¬ 
lated  with  a  peak  in  M,  and  the  genes  are  very 
strongly  induced  by  GAL-CLB2,  whereas  GAL-CLN3 
appears  somewhat  repressive.  It  was  previously 
known  that  four  of  the  genes  found  in  this  cluster, 
CLBl,  CLB2,  SWI5,  and  BUD4,  are  regulated  by  a 
combination  of  two  transcription  factors,  Mcmlp  and 
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Figure  4.  The  S  and  M  clusters.  The  transcription  profiles  are  displayed  as  described  in  the  legend  to  Figure  1.  (A)  Histone  cluster.  The  eight 
genes  encoding  histones  and  the  yeast  histone  HI  homologue  cluster  very  tightly  and  are  expressed  during  S  phase  of  the  yeast  cell  cycle. 
(B)  MET  cluster.  The  expression  of  many  of  the  members  of  the  methionine  pathway  peaks  just  after  the  histones.  (C)  CLBl  cluster.  A 
subcluster  of  genes  that  are  expressed  similarly  to  CLBl  highlights  genes  that  peak  during  M  phase. 


SFF  (Althoefer  et  al.,  1995;  Sanders  and  Herskowitz, 
1996).  Mcmlp  binds  to  the  consensus  TTACCNAAT- 
TNGGTAA  (Acton  ei  a\.,  1997),  whereas,  on  the  basis 
of  three  of  these  genes,  SFF  was  thought  to  bind  to  the 
consensus  sequence  GTMAACAA.  Furthermore,  tran¬ 
scription  of  CLBl,  CLBl,  and  SW75  was  known  to  be 
induced  by  Clb2p  activity,  possibly  because  of  post- 
translational  activation  of  SFF  (Amon  et  al,  1993). 
We  compared  the  upstream  regions  of  genes  in  the 
CLBl  cluster  and  certain  other  coregulated  genes 
(e.g.,  ASEl,  also  thought  to  be  a  possible  target  of 
SFF  [Pellman  et  al.,  1995])  and  found  that  most  of 
them  contain  an  easily  recognizable  MCMl  +  SFF 
motif.  Of  the  35  genes  in  the  cluster,  only  9  genes 
(KIPl,  MOBl,  NUMl,  YCLOllW,  BUD3,  CHAl, 
YCL063W,  YLR057W,  and  YML033W)  did  not  have 
an  easily  recognizable  near  match  to  the  MCMl  + 


SFF  consensus.  An  alignment  of  the  genes  that  con¬ 
tained  this  site  can  be  viewed  on  our  web  site,  and 
on  the  basis  of  this  alignment,  we  deduce  a  new 
consensus  for  MCMl  +  SFF  binding,  shown  in  Fig¬ 
ure  5. 


The  M/Gl  Clusters 

The  "MCM"  cluster  (Figure  6A)  contains  34  genes,  in¬ 
cluding  all  six  MCM  genes  that  are  directly  involved  in 
DNA  repUcation  (MCM2,  MCM3,  CDC54,  CDC46, 
MCM6,  and  CDC47;  reviewed  by  Chevalier  and  Blow, 
1996)  as  well  as  FARl,  DBFl,  SPOll,  and  KIN3.  These 
genes  peak  late  in  the  cycle,  at  about  the  M/Gl  bound¬ 
ary,  and  are  induced  by  CLBl  and  somewhat  repressed 
by  CLN3.  This  cluster  has  similarities  to  the  CLBl  cluster, 
except  that  peak  expression  is  slightly  later.  Searches  of 
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Figure  5  The  MCMl  +  SFF  consensus.  By  aligning  promoter  elements  of  several  coregulated  genes  found  m  the  CLB2  cluster  (see  our  web 
site  for  the  alignment),  we  developed  a  matrix  for  a  new  MCMl  +  SFF  consensus.  The  number  of  times  each  base  was  found  at  each  position 
in  the  site  was  tallied  and  is  displayed.  The  consensus  was  determined  by  examining  the  nucleotide  frequencies  at  each  position. 


the  upstream  regions  reveal  that  the  majority  of  these 
genes  contain  binding  sites  for  Mcmlp,  as  was  previ¬ 
ously  shown  for  some  members  of  the  cluster  (Mclnemy 
et  al,  1997).  Some,  but  not  all,  of  these  MCMl  sites  have 
nearby  sites  for  SFF  (e.g.,  in  FARl,  SP012,  KINS,  and 
CDC47),  although  these  presumptive  SFF  sites  are  of 
varying  quality.  It  has  been  suggested  that  some  of 
the  genes  in  this  cluster  are  regulated  through  the 
"ECB,"  a  variant  of  the  Mcmlp  binding  site  (Mcln- 
erny  et  al,  1997). 

The  "SICl"  cluster  comprises  27  genes,  including 
EGT2,  PCL9,  TECl,  ASHl,  SICl,  and  CTSl.  These 
genes  are  strongly  cell  cycle  regulated  (Figure  6B)  and 
peak  in  late  M  or  at  the  M/Gl  boundary.  GAL-CLN3 
may  repress  some  of  these  genes,  whereas  GAL-CLB2 
has  no  consistent  effect  on  the  expression  of  these 
genes.  Several  of  these  genes  are  known  to  be  regu¬ 
lated  by  the  transcription  factor  Swi5p,  which  itself  is 
a  member  of  the  CLB2  cluster  (Dohrmann  et  al,  1992; 
Bobola  et  al,  1996;  Knapp  et  al,  1996).  SwiSp  is  thought 
to  bind  to  a  site  with  the  consensus  ACCAGC  (Knapp 
et  al,  1996),  and  indeed,  when  we  searched  for  com¬ 
mon  motifs  in  the  5'  regions  of  the  SICl  cluster,  we 
found  the  consensus  RRCCAGCR  in  many  of  the  27 
genes.  When  all  cell  cycle-regulated  genes  were  ex¬ 
amined  for  the  presence  of  either  the  original  SwiSp 
consensus,  or  this  new  extended  consensus,  the  ex¬ 
tended  consensus  was  found  to  be  much  more  specific 
for  late  M-phase  genes.  This  comparison  is  shown  on 
our  web  site.  The  motif  GCSCRGC  was  also  found  in 
~40%  of  the  genes  in  this  cluster. 

The  "MAT"  cluster  contains  13  genes  and  is  shown 
in  Figure  6C.  Some  of  these  genes  (MFal,  MFa2,  and 
STE3)  are  specific  for  MATa  cells  (Jarvis  et  al,  1988) 
and  so  are  significantly  expressed  only  in  the  cdcl5 
experiment,  which  was  done  with  a  MATa  strain. 
Other  genes  in  the  cluster  (KAR4,  AGAl,  SST2,  and 
FUSl)  are  induced  by  a  factor  and  so  are  very  strongly 
expressed  at  the  beginning  of  the  a  factor  experiment. 
However,  these  four  genes  oscillate  in  the  other  exper¬ 
iments  when  no  a  factor  is  present.  We  found  MCMl 
binding  sites  in  the  upstream  regions  of  several  of 
these  genes,  including  MFal  and  MFa2.  Furthermore, 
as  discussed  below,  we  found  MATal,  the  transcrip¬ 
tion  factor  that  cooperates  with  Mcmlp  to  induce 


a-specific  genes,  is  itself  cell  cycle  regulated,  and  this 
may  largely  explain  the  oscillation  of  the  a  specific 
genes  in  this  cluster. 

Other  Genes  and  Regulators 

The  nine  clusters  or  near  clusters  summarized  in  Table 
4  account  for  about  half  of  the  cell  cycle-regulated 
genes.  The  remaining  genes  tend  to  be  less  strongly 
cell  cycle  regulated  and  cluster  less  tightly.  We  have 
attempted  to  find  novel  elements  in  the  promoters  of 
the  remaining  genes  without  great  success.  The  best  of 
these  elements  was  the  consensus  GCAGNRNCCW, 
which  we  found  in  the  upstream  regions  of  CLB4, 
BUD3,  CPR8,  PR02,  YCL012W,  YCLOBSW,  YGL217C, 
YNL043C,  YDR130C,  and  YOL030W;  these  genes  ap¬ 
pear  to  be  moderately  well  coregulated  (peak  expres¬ 
sion  occurs  in  G2).  There  may  be  additional,  novel, 
upstream  elements  that  we  are  unable  to  find. 

It  is  likely  that  many  of  the  remaining  genes  are 
actually  coregulated  with  members  of  the  clusters  we 
have  described,  and  their  transcription  may  be  con¬ 
trolled  by  the  same  types  of  elements.  Indeed,  we 
know  that  some  of  the  remaining  genes  have  recog¬ 
nizable  elements  (e.g.,  MCBs  and  SCBs),  whereas  in 
other  cases,  the  elements  may  be  highly  degenerate 
versions  of  the  known  elements.  This  may  explain 
why  the  cell  cycle  regulation  we  observe  is  relatively 
weak  and  why  the  genes  do  not  cluster  tightly.  Finally, 
mRNA  levels  could  oscillate,  not  because  of  transcrip¬ 
tional  control,  but  because  of  cell  cycle  control  of 
mRNA  stability;  the  histone  mRNAs  are  controlled 
partly  in  this  way  (Wang  et  al,  1996). 

For  the  clusters  we  have  identified,  some  of  the 
genes  in  the  cluster  do  not  contain  an  obvious  element; 
for  instance,  nine  of  the  genes  in  the  CLB2  cluster  do 
not  contain  an  obvious  MCMl  +  SFF  site.  We  do  not 
know  whether  these  genes  contain  cryptic,  degenerate 
sites  that  our  algorithms  fail  to  recognize,  or  whether 
these  genes  are  regulated  by  an  rmknown  factor. 

The  Functions  of  the  Cell  Cycle-regulated  Genes 

The  major  functions  of  the  cell  cycle  regulated  genes  we 
identified  are  cell  cycle  control,  DNA  replication,  DNA 
repair,  budding,  glycosylation,  nuclear  division  and  mi- 
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tosis,  structure  of  the  cytoskeleton,  and  mating.  In  Figure 
7  we  arrange  294  named  genes  in  our  set,  according  to 
both  a  functional  class  and  the  phase  group  to  which 
they  belong. 

DNA  Replication,,  Repair,  and  Chromosome 
Assembly 

It  is  instructive  to  look  at  the  pattern  of  expression  of 
genes  involved  in  a  particular  process.  For  instance,  we 
can  trace  the  expression  of  many  genes  somehow  in¬ 
volved  in  DNA  replication  (as  shown  in  Figure  7).  Of  the 


genes  that  peak  in  G1  there  are  23  genes  with  known 
functions  in  DNA  replication.  These  genes  include  sub¬ 
units  of  the  DNA  polymerases  and  their  accessory  fac¬ 
tors  (e.g.  CDC2,  POLl,  and  POL2),  genes  involved  in 
nucleotide  synthesis  (e.g.  CDC21),  and  genes  involved  in 
initiation  of  DNA  synthesis  (e.g.  CDC45).  Many  genes 
involved  in  DNA  repair  such  as  PMSl  and  MSH2  reach 
peak  expression  in  G1  phase,  suggesting  that  repair  of 
DNA  lesions  may  be  a  normal  part  of  S  phase. 

Later,  when  S  phase  is  actually  occurring,  the  his¬ 
tone  genes  reach  peak  expression.  In  late  M  phase  or 
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Figure  7 
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M/Gl  all  six  MCM  genes  important  for  prereplicative 
complex  formation  (MCM2,  MCM3,  CDC54,  CDC47, 
MCM6,  CDC47,  and  CDC54)  and  CDC6  reach  their 
peaks,  presumably  to  help  set  up  origins  for  the  next 
cell  cycle.  Thus,  many  genes  needed  for  replication 
and  repair  reach  peak  expression  just  before  they  are 
needed,  the  histones  peak  exactly  at  the  time  they  are 
needed,  and  a  few  genes  important  for  regulation  of 
DNA  synthesis  peak  well  in  advance  of  the  next  round 
of  S  phase.  Only  two  known  initiator  genes,  CDC45 
and  DBF4  (which  we  did  not  identify  in  our  analysis; 
see  below)  peak  just  before  S  phase,  suggesting  these 
may  be  particularly  important  to  trigger  replication. 

Bud  Initiation  and  Bud  Growth 

Budding  is  a  major  metabolic  activity  for  the  cell  and 
involves  several  subprocesses.  The  cell  must  choose  a 
site  for  the  new  bud  (initiation)  and  make  components 
for  an  ever-increasing  surface  area  consisting  of  a  new 
cell  membrane  (which  requires  lipids  and  integral 
membrane  proteins)  and  a  new  cell  wall  (composed 
largely  of  glucan,  chitin,  and  mannoproteins).  All  of 
these  processes  require  delivery  of  components,  via 
the  secretory  apparatus,  to  the  sites  of  new  membrane 
and  cell  wall  synthesis,  which,  in  normal  conditions, 
occurs  exclusively  in  the  bud  (Kaiser  et  ah,  1997;  for 
reviews,  see  Lew  et  al,  1997;  Orlean,  1997). 

We  found  17  genes  that  involved  in  bud  site  selec¬ 
tion  and  cell  polarization  (e.g.,  BIID3,  BUD4,  BUDS, 
BUD9,  BEMl,  GICl,  MSBl,  and  MSB2).  As  indicated 
in  Figure  7,  none  of  these  genes  had  been  reported  to 
be  cell  cycle  regulated.  Some  of  these  (BUD9,  CDCIO, 
and  RSRl)  show  peak  expression  in  Gl,  consistent 
with  roles  in  bud  initiation.  Others,  (BUD4,  BUDS,  and 
BEMl)  peak  in  M  phase,  suggesting  roles  in  the  fol¬ 
lowing  cell  cycle,  i.e.,  earlier  in  the  budding  pathway 
than  the  Gl  group.  We  also  identified  many  genes 
needed  for  secretion,  glycosylation  (needed  for  mak¬ 
ing  mannoproteins),  synthesis  of  lipids,  and  cell  wall 
synthesis. 

Cell  Division  and  Mitosis 

Another  fundamental  process  of  cell  division,  in 
which  a  large  number  of  the  genes  involved  have  their 
messages  regulated  by  the  cell  cycle,  is  the  process  of 
mitosis  (for  review  of  microtubule-related  topics,  see 
Botstein  et  ah,  1997).  During  the  cell  cycle  many  events 
occur  that  allow  mitosis  to  progress  in  a  timely  man¬ 
ner.  This  process  begins  in  Gl  when  the  spindle  pole 


body  (SPB)  replicates.  To  facilitate  this  process  six 
known  components  of  the  SPB  reach  peak  expression 
in  Gl  {CNM67,  NUFl,  SPC42,  SPC97,  SPC9S,  and 
TUB4),  one  (SPC34)  peaks  during  S,  and  one  iNUF2) 
peaks  during  M  phase.  Some  of  these  genes  were 
already  known  to  be  cell  cycle  regulated  (NUFl,  and 
SPC42)  (Kilmartin  et  al,  1993;  Donaldson  and  Kilmar- 
tin  1996). 

Once  the  mitotic  program  is  entered  the  cell  must 
create  a  spindle,  which  is  responsible  for  moving  the 
nucleus  to  the  bud  neck  so  that  nuclear  division  can 
occur.  This  process  requires  microtubules  and  many 
accessory  proteins  (to  form  the  spindle)  as  well  as 
kinesins  (for  movements  of  the  nucleus  and  the  SPB). 
These  genes  reach  peak  expression  largely  during  the 
first  half  of  the  cell  cycle.  In  Gl  BIMl,  BUBl,  IPLl, 
KAR3,  and  SLK19  reach  peak  expression,  and  during 
S,  five  genes  (CINS,  KAR9,  KIPl,  STU2,  and  VIKl) 
peak.  Five  genes  peak  during  G2  (BUB2,  CIKl,  KIP2, 
K1P3,  and  NUMl),  as  well  as  the  major  j8  tubulin 
TUB2.  Finally,  one  gene  (ASEl)  reaches  peak  expres¬ 
sion  during  M. 

It  was  somewhat  unexpected  that  tubulin  messages 
would  be  regulated  by  the  cell  cycle;  unfortunately  the 
microarrays  that  we  used  for  the  a  factor  and  elutria- 
tion  experiments  did  not  contain  DNA  complemen¬ 
tary  to  either  the  major  (TUBl)  or  minor  (TUB3)  a 
tubulins.  Our  data  set  suggested  that  that  TUBl  might 
be  cell  cycle  regulated  because  it  had  a  score  just 
below  our  cutojQf.  We  wished  to  verify  that  the  major 
tubulins  were  regulated  in  the  cell  cycle  by  an  inde¬ 
pendent  method  (quantitative  real-time  PGR  [Heid  et 
al.,  1996]).  This  method  allows  determinations  of  rel¬ 
ative  mRNA  levels  with  excellent  reproducibility.  We 
performed  the  analysis  as  detailed  in  MATERIALS 
AND  METHODS  with  the  result  that,  as  we  sus¬ 
pected,  TUBl  and  TUB2  are  moderately  cell  cycle 
regulated,  but  TUB3  appears  less  so  (Figure  8).  This 
suggests  that  the  low  score  for  TUBl  may  have  been 
caused  by  their  absence  from  some  of  the  arrays.  It 
should  be  noted  that  TUB2  with  a  score  of  2.33  is 
clearly  above  the  threshold  we  set  for  cell  cycle  regu¬ 
lation,  but  that  TUBl  with  a  score  of  1.25  is  just  below, 
and  TUB3  (score  0.53)  is  considerably  below  the 
threshold.  Comparison  with  Figure  8  illustrates  the 
point  that  a  score  near  the  threshold  can  be  the  result 
either  of  inadequate  data  or  weak  regulation. 

One  relatively  small  class  of  genes  that  displays 
tight  temporal  regulation  is  a  group  of  genes  involved 
in  chromatid  cohesion.  Five  of  these  genes  (SMCl, 


Figure  7  (facing  page).  Cell  cycle-regulated  genes  with  characterized  functions.  Two  hundred  ninety-seven  of  the  cell  cycle-regulated  genes 
are  grouped  by  both  function  and  phase  of  peak  expression.  Several  functional  groups  are  split  into  subgroups,  which  reflect  the  nature  of 
the  function.  Those  highlighted  in  red  were  previously  known  to  be  cell  cycle  regulated.  Many  functional  categories  display  strong  biases 
toward  gene  expression  during  particular  intervals  of  the  cell  cycle.  Obvious  examples  include  genes  involved  in  DNA  synthesis  and  DNA 
repair  (Gl),  mating  (M,  M/Gl,  and  Gl),  chromatin  structure  (Gl  and  S),  and  methionine  biosynthesis  (S  and  G2). 
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Figure  8.  Tubulin  message  levels.  The  mRNA  levels  for  TUBl, 
TUB2,  and  TUB3,  relative  to  those  of  PPAl,  were  determined  during 
synchronous  division  after  release  from  an  a  factor  arrest,  using  the 
TAQman  assay  as  described  in  MATERIALS  AND  METHODS. 


SMC3,  MCDl,  PDSl,  and  PDS5  [Strunnikov  et  al, 
1993;  Yamamoto  et  al,  1996;  Guacci  et  al,  1997; 
Michaelis  et  al,  1997])  have  peak  expression  during  G1 
just  before  the  next  round  of  DNA  synthesis. 

At  the  end  of  the  cell  cycle  the  cell  must  exit  mitosis 
so  that  the  next  round  of  division  can  occur.  To  do 
this,  a  system  of  proteins  acts  to  inhibit  the  activity  of 
Clb-Cdc28p.  One  of  these  proteins  is  Siclp,  whose 
expression  is  known  to  peak  at  this  time  (Donovan  et 
al,  1994).  Many  of  the  proteins  that  inhibit  Clb-Cdc28p 
or  prepare  the  cell  to  exit  from  mitosis  are  known  to  be 
cell  cycle  regulated  and  peak  in  M  phase.  We  also  find 
that  DBF20  (which  is  functionally  related  to  DBF2)  is 
cell  cycle  regulated  and  peaks  in  G2. 

Mating 

At  least  19  genes  directly  involved  in  mating  are  cell 
cycle  regulated.  These  include  both  mating  phero¬ 
mones  (a-factor  and  a-factor)  and,  perhaps  most  inter¬ 
estingly,  include  the  central  mating-type  transcription 
factor  MATal  itself.  MATal  binds  to  DNA  in  cooper¬ 
ation  with  Mcml  (Sengupta  and  Cochran,  1991)  and 
induces  expression  of  a-specific  genes.  It  was  previ¬ 
ously  shown  that  some  genes  involved  in  mating  were 
cell  cycle  regulated,  and  this  regulation  was  shown  to 
be  due  to  cooperative  binding  between  Mcml  and 
Stel2.  The  fact  that  the  MATal  transcription  factor 
itself  oscillates  provides  yet  another  mechanism  by 
which  genes  involved  in  mating  might  be  cell  cycle 
regulated.  We  found  Mcml  sites  in  the  upstream  re¬ 
gions  of  several  of  these  genes,  including  MATal.  The 
regulation  of  genes  involved  in  mating  is  clearly  com¬ 
plex,  and  several  transcription  factors  are  involved. 
However,  it  seems  that  most  of  these  transcription 
factors  cooperate  in  one  way  or  another  with  Mcml. 


The  fact  that  so  many  mating  functions  are  cell  cycle 
regulated,  including  an  a-specific  transcription  factor, 
helps  explain  the  deep  connection  between  mating, 
start,  and  the  cell  cycle.  For  instance,  if  genes  involved 
in  mating  are  turned  off  at  start  by  multiple  mecha¬ 
nisms,  it  helps  explain  how  passage  through  start 
precludes  mating. 

Cell  Cycle  Control  Genes 

Of  the  19  genes  involved  in  cell  cycle  control  we 
identified,  17  were  already  known  to  be  cell  cycle 
regulated.  This  set  mainly  includes  cyclins  and  tran¬ 
scription  factors,  whose  activities  and  time  of  action 
are  well  documented  (see  Koch  and  Nasmyth,  1994; 
Andrews  and  Measday,  1998).  The  only  two  cell  cycle 
control  genes  that  we  identified  newly  as  regulated 
were  WHI3  and  HSL7. 

Methionine  Biosynthesis 

It  was  an  unexpected  and  somewhat  surprising  result 
that  many  genes  involved  in  methionine  biosynthesis 
are  cell  cycle  regulated.  A  number  of  possibilities  sug¬ 
gest  themselves.  First,  the  pool  of  available  cellular 
methionine  is  smaller  than  virtually  any  other  amino 
acid;  thus,  methionine  is  likely  to  be  limiting  (Jones 
and  Fink,  1982).  Indeed,  Unger  and  Hartwell  (1976) 
noted  that  starvation  for  sulfur  or  for  methionine  ef¬ 
fectively  causes  G1  arrest,  suggesting  that  cell  cycle 
progression  is  particularly  sensitive  to  the  availability 
of  methionine.  They  also  found  that  a  temperature- 
sensitive  allele  of  methionine  tRNA  synthetase  causes 
G1  arrest,  even  in  the  presence  of  methionine.  These 
observations  suggest  that  the  cell  cycle  regulation  of 
methionine  genes  ensures  sufficient  capacity  for  pro¬ 
tein  synthesis  in  that  biosynthetic  pathway  for  the  next 
cell  cycle;  if  there  are  insufficient  resources,  G1  arrest 
ensues. 

It  is  known  that  the  more  than  20  genes  that  consti¬ 
tute  the  sulfur  amino  acid  biosythesis  pathway  are 
coordinately  regulated  at  the  level  of  transcription. 
This  transcription  is  repressed  in  response  to  an  in¬ 
crease  in  the  intracellular  concentration  of  S-adenosyl- 
methionine,  an  end  product  of  the  pathway  (methio- 
nyl  tRNA  is  another  end  product)  (Thomas  et  al, 
1989).  A  second  possibility  therefore  is  that  the  con¬ 
centration  of  S-adenosylmethionine  is  depleted  as 
cells  enter  S  phase,  causing  derepression  of  these 
genes,  which  results  in  cell  cycle  regulation. 

A  third  possibility  is  that  the  protein  that  actually 
represses  these  genes,  Met30p,  is  available  in  limiting 
amounts  and  for  some  reason  is  titrated  during  or  just 
before  S  phase,  causing  coordinate  derepression  of 
this  set  of  genes.  Data  supporting  this  idea  are  that 
Met30p  is  involved  in  cell  cycle  regulation  as  an  F-box 
protein  that  targets  Swel  for  degradation  (Kaiser  et  al, 
1998;  Patton  et  al,  1998).  SWEl  transcription  is  cell 
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cycle  regulated  (Ma  et  al.,  1996)  (our  analyses  recapit¬ 
ulate  this  observation),  peaking  at  the  Gl/S  phase 
boundary.  Therefore,  the  concentration  of  a  known 
MetSOp  substrate  increases  just  before  the  derepres¬ 
sion  of  genes  involved  in  methionine  biosynthesis  that 
MetSOp  is  known  to  repress.  Thus,  MetSOp  may  be¬ 
come  limiting,  allowing  expression  of  the  MET  genes. 
We  do  not  know  whether  the  cell  cycle  regulation  of 
these  genes  is  important  for  their  function. 

Interestingly,  another  F-box  protein,  Grrl,  which  is 
also  involved  in  cell  cycle  regulation,  regulates  the 
expression  of  the  HXT  genes  (hexose  transporters). 
The  HXT  genes  are  members  of  a  cluster  of  very 
weakly  cell  cycle-regulated  genes  peaking  in  M/Gl 
that  also  includes  PHDl  and  RGAl  (visible  in  Figure 
IB  on  our  web  site).  Thus,  two  different  F-box  proteins 
involved  in  cell  cycle  control  also  regulate  genes  in¬ 
volved  in  providing  nutrients,  and  these  nutrient-re¬ 
lated  genes  are  weakly  cell  cycle  regulated.  It  is  pos¬ 
sible  that  these  F-box  proteins  somehow  coordinate 
nutrient  availability  with  the  cell  cycle. 

Other  Nutritional  Genes 

A  very  large  fraction  of  the  genes  involved  in  nutrition 
that  are  cell  cycle  regulated  are  involved  in  transport 
of  essential  minerals  and  organic  compounds  across 
the  cell  membrane.  Some  of  the  compounds  that  are 
moved  by  these  transporters  are  amino  acids  (GAPl), 
ammonia  {AUAl  and  MEP3),  sugars  (e.g.,  HXTl  and 
RGT2),  and  iron  {FET3  and  FTRl).  We  also  identified 
the  acid  phosphatases  (e.g.,  PH03  and  PH08).  Nearly 
all  of  these  genes  reach  peak  expression  late  in  the  cell 
cycle  during  M  and  M/Gl. 

Developmental  Pathway  Genes:  Sporulation  and 
Pseudohyphal  Growth 

A  number  of  genes  associated  with  functions  in  spe¬ 
cialized  developmental  pathways  show  cell  cycle  reg¬ 
ulation.  These  include  the  apparently  sporulation-spe- 
cific  genes  SPS4  and  SSP2,  which  have  peak 
expression  in  M  and  M/Gl,  respectively.  These  might 
represent  cases  such  as  SP012,  which  has  known  func¬ 
tion  in  both  mitotic  and  meiotic  pathways  (Klapholz 
and  Esposito,  1980;  Toyn  and  Johnston,  1993). 

The  Y'  Genes 

Although  not  strictly  a  functional  category,  the  Y' 
genes  form  an  interesting  group  of  coregulated  genes. 
The  Y'  sequences  are  repeated  sequences  found  just 
centromere  proximal  to  the  telomere  itself  on  many 
chromosomes.  Within  the  Y'  elements  are  two  open 
reading  frames,  and  there  are  appropriate  splicing 
signals  that  suggest  that  they  form  one  large  product, 
although  it  has  not  been  shown  experimentally  that 
these  sites  are  functional  (Louis  and  Haber,  1992; 


Louis,  1995),  The  larger  (telomere  proximal)  of  these 
two  ORFs  shows  similarity  to  RNA  helicases,  contain¬ 
ing  all  the  motifs  known  to  be  necessary  for  helicase 
activity  (Louis  and  Haber,  1992).  However,  sequence 
similarities  among  these  ORFs  are  very  high,  and  we 
cannot  distinguish  whether  one,  a  few,  or  all  of  these 
elements  are  cell  cycle  regulated. 


The  GAL~CLN3  and  GAL-CLB2  Experiments 

Our  experiments  to  investigate  the  transcriptional  ef¬ 
fects  of  Cln3p  and  Clb2p  provide  an  excellent  corrob¬ 
orative  data  set  that  supports  cell  cycle  regulation  for 
more  half  of  the  genes  in  our  list.  Of  the  genes  that  are 
cell  cycle  regulated,  there  are  116  genes  that  are  in¬ 
duced  more  than  twofold  by  Cln3p,  and  are  repressed 
by  Clb2p.  Eighty-seven  percent  of  these  peak  in  either 
G1  or  S  phase.  In  contrast  there  are  only  eight  cell 
cycle-regulated  genes  that  are  induced  by  Cln3p  and 
not  repressed  by  Clb2p.  There  are  33  genes  induced 
greater  than  twofold  by  Clb2p  that  are  repressed  by 
Cln3p,  whereas  only  five  genes  induced  by  Clb2  are 
not  repressed  by  Cln3p.  All  cell  cycle-regulated  genes 
responsive  to  Clb2p  peak  in  either  M  or  M/Gl  phases. 

There  were  also  genes  that  responded  to  Clb2p  or 
Cln3p  (or  both)  that  we  did  not  identify  as  cell  cycle 
regulated.  For  instance  there  are  53  genes  induced  by 
Cln3p  and  repressed  by  Clb2p  that  are  not  on  our  cell 
cycle-regulated  list.  Many  of  these  are  involved  in 
functions  for  which  we  know  many  genes  are  cell 
cycle  regulated,  e.g.,  secretion  (PMT2,  PMT4,  SEC53, 
and  SEC21),  chitin  synthesis  (CHS3),  and  nucleotide 
biosynthesis  (ADE3,  RNR2).  However,  we  have  no 
other  evidence  to  suggest  that  these  may  be  false 
negatives.  Indeed,  by  visual  inspection,  none  of  these 
genes  displays  convincing  signs  of  periodicity.  This 
observation  reinforces  the  notion  that  using  many 
types  of  experiments  is  crucial  to  drawing  legitimate 
conclusions. 

Our  experiments  on  the  transcriptional  effects  of 
Cln3p  and  Clb2p  help  us  dissect  the  transcriptional 
regulators  of  each  gene  (see  above).  In  addition,  they 
support  the  notion  that  mechanistically  two  opposing 
oscillators  drive  the  cell  cycle.  This  is  particularly  well 
illustrated  in  some  of  the  subclusters,  for  instance,  the 
CLN2  cluster,  where  the  effects  of  CLN3  induction  are 
almost  exactly  mirrored  by  the  opposite  effects  of 
CLB2  induction.  For  other  subclusters  we  see  that  the 
genes  respond  to  only  one  of  the  cyclins,  (e.g.,  the  Y' 
cluster  is  induced  by  CLN3  yet  relatively  unchanged 
by  CLB2). 

Finally,  we  found  that  CLN3  can  repress  the  tran¬ 
scription  of  certain  genes,  particularly  a  group  of 
genes  involved  in  mating.  This  was  not  entirely  unex¬ 
pected,  because  it  had  previously  been  demonstrated 
that  FARl  transcription  (McKinney  et  al,  1993)  is  neg- 
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atively  regulated  by  start,  although  a  direct  link  to 
ClnSp  activity  has  not  been  previously  demonstrated. 

DISCUSSION 

Reliability  of  the  Methods  for  Assessing  Cell  Cycle 
Regulation 

Our  identification  of  a  gene  as  cell  cycle  regulated  was 
objective  in  the  sense  that  it  was  entirely  quantitative. 
Nevertheless,  it  should  be  clear  that  the  setting  of  the 
threshold  was  in  the  end  arbitrary.  We  can  ask 
whether  genes  below  the  threshold  might  still  be  cell 
cycle  regulated  in  a  biologically  significant  way,  and 
we  can  consider  whether  there  are  any  circumstances 
in  which  our  experiments  might  have  failed  to  reveal 
the  cell  cycle  pattern  of  regulation.  In  this  context  it  is 
relevant  that  we  do  not  identify  9  of  the  104  genes 
{CDC8,  DBF4,  CHS3,  PRll,  TIRl,  CDC14,  CLB3,  DPB3, 
and  RAD17)  previously  reported  to  be  cell  cycle  reg¬ 
ulated  (White  et  al,  1987;  Chapman  and  Johnston, 
1989;  Siede  et  al,  1989;  Johnston  et  al,  1990b;  Araki  et 
al,  1991;  Fitch  et  al,  1992;  Wan  et  al,  1992;  Igual  et  al, 
1996;  Caro  et  al.,  1998).  Of  these,  only  RADII  appears 
visually  to  vary  with  the  cell  cycle  in  our  data  and  then 
only  in  the  cdcl5  experiment.  We  believe  that  most  of 
the  remaining  false  negatives  are  due  to  noise  in  our 
data  set  that  dampens  their  signal.  It  is  worth  noting 
that  some  of  these  genes  showed  only  very  weak  cell 
cycle  regulation  in  the  original  publications  or  in  other 
studies  done  by  traditional  methods.  For  instance, 
DBF4  oscillated  by  2.5-fold  in  the  original  publication 
(Chapman  and  Johnston,  1989),  and  in  some  Northern 
blotting  experiments,  oscillation  is  not  seen  (Sclafani, 
personal  communication). 

This  analysis  convinces  us  that  relatively  few  genes 
with  significant  cell  cycle  regulation  have  not  been 
identified.  However,  there  are  many  plausible  causes 
of  false-positive  identifications.  Random  fluctuations 
in  the  data  could  appear  as  a  cell  cycle  oscillation,  but 
as  described  in  MATERIALS  AND  METHODS,  we 
expect  this  to  be  a  relatively  rare  event. 

Cross-hybridization  between  genes  whose  DNA  se¬ 
quences  are  similar  can  produce  false  positives  when 
only  one  of  the  genes  is  actually  cell  cycle  regulated.  It 
has  been  estimated  that  cross  hybridization  in  our 
system  can  become  significant  at  or  above  75%  DNA 
sequence  identity  (DeRisi,  Iyer,  and  Brown,  personal 
communication).  For  instance,  our  data  set  includes 
both  DBPl  and  DBP20,  which  are  75%  identical  over 
the  last  800  bp  of  each  gene;  it  has  previously  been 
published  that  only  DBPl  is  cell  cycle  regulated 
(Johnston  et  al,  1990a;  Toyn  et  al,  1991).  DBF20  may 
appear  in  our  set  because  DBF2  cDNA  is  cross-hybrid¬ 
izing  with  the  DNA  sequence  of  DBF20  on  the  mi¬ 
croarray.  These  two  genes  show  somewhat  different 
regulation,  however,  so  this  could  be  an  instance  of 
strain  differences  or  errors  in  the  published  literature. 
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Figure  9.  Aggregate  CDC  score  is  largely  independent  of  the  cdc28 
data  set.  Scatter  plot  for  the  scores  that  were  derived  using  either  the 
cdcl5  and  a  factor  experiments  only  (plotted  on  the  y  axis)  or  using 
the  cdcl5,  the  a  factor,  and  the  cdc28  experiments  (x  axis).  A  single 
dot  is  plotted  for  each  gene.  It  should  be  noted  that  adding  data 
increases  the  absolute  magnitude  of  the  scores;  it  is  the  relative 
magnitude,  however,  that  indicates  cell  cycle  regulation. 


A  third  manner  in  which  false  positives  can  occur  is 
when  an  unregulated  gene  overlaps  the  mRNA  for  a 
cell  cycle-regulated  gene.  The  cDNA  corresponding  to 
the  regulated  gene  would  hybridize  with  the  unregu¬ 
lated  gene's  DNA,  generating  a  false  positive.  In  our 
list  there  are  42  pairs  of  genes  that  overlap,  and  39 
additional  pairs  in  which  the  distance  between  the  two 
genes  is  <300  bp.  A  full  list  of  all  genes  that  are  near 
each  other  by  chromosomal  position  can  be  found  at 
the  web  site. 

Comparisons  with  Previous  Analyses 
One  part  of  our  method  for  identifying  genes  regu¬ 
lated  by  the  cell  cycle  relied  on  the  examining  the  large 
body  of  data  already  that  has  been  published.  Cho  et 
al  (1998)  performed  a  study  similar  to  ours  using 
different  technology  and  methods.  Our  methods  can 
aggregate  data  from  many  experiments  and  thereby 
improve  the  signal-to-noise  ratio  in  the  total  data  set. 
It  illustrates  as  well  the  value  and  necessity  for  making 
available  primary  data  from  genome  scale  experi¬ 
ments  of  this  kind.  The  data  from  Cho  et  al  (1998) 
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aided  in  our  ability  to  identify  some  genes  as  being  cell 
cycle  regulated.  Figure  9  shows  several  points  located 
very  near  the  x  axis  that  represent  genes  whose  iden¬ 
tification  as  cell  cycle  regulated  clearly  depends  on  the 
data  of  Cho  et  al  The  assessment  of  cell  cycle  regula¬ 
tion  for  the  remainder  of  the  genes,  as  shown  in  Figure 
9,  is  essentially  the  same  whether  the  data  of  Cho  et  al 
are  included. 

The  most  obvious  difference  between  our  results 
and  those  of  Cho  et  al.  (1998)  is  in  the  number  of  genes 
identified  as  cell  cycle  regulated.  With  a  manual  deci¬ 
sion  process,  they  found  421  genes  to  be  cell  cycle 
regulated.  Our  set  of  800  genes  includes  304  of  these, 
but  the  other  117  do  not  appear  significantly  cell  cycle 
regulated  in  our  data.  Our  set  of  800  therefore  contains 
496  genes  not  identified  by  Cho  et  al. 

There  were  many  technical  differences  in  the  way 
the  two  studies  were  carried  out,  and  it  is  difficult  to 
say  how  these  may  have  contributed  to  the  differences 
between  the  results.  One  significant  advantage  of  our 
analysis  was  the  diversity  of  experiments  from  which 
we  could  identify  the  characteristic  pattern  of  cell 
cycle  regulation.  This  allowed  us  to  distinguish  cell 
cycle  regulation  from  confounding  patterns  such  as 
those  caused  by  the  heat  shock  response  when  a  cul¬ 
ture  is  shifted  from  one  temperature  to  another. 

One  of  the  largest  discrepancies  between  the  two 
analyses  regards  genes  that  may  peak  twice  per  cell 
cycle.  We  identified  10  genes  as  cell  cycle  regulated 
that  according  to  Cho  et  al  (1998)  showed  more  than 
one  peak  but  had  no  single  prominent  peak  in  expres¬ 
sion.  Our  algorithm  is  designed  to  find  genes  with 
single  expression  peaks;  it  significantly  penalizes  more 
than  one  peak.  Furthermore,  visual  inspection  of  the 
aggregate  data  does  not  show  multiple  peaks  in  any  of 
these  cases.  Therefore,  we  believe  the  aggregate  data 
set  does  not  support  the  existence  of  multiple  peaks  of 
expression  for  these  genes.  This  leaves  open  the  pos¬ 
sibility  that  there  may  be  other  genes  with  more  than 
one  peak  per  cell  cycle. 

Many  of  the  genes  differing  between  the  two  data 
sets  had  low  peak-to-trough  ratios  and  relatively  poor 
aggregate  CDC  scores.  It  is  perhaps  natural  that  we 
identified  a  larger  number  of  modestly  regulated 
genes,  because  we  had  a  larger  number  of  experiments 
and  a  statistical  rather  than  manual  approach  to  iden¬ 
tification.  It  is  important  to  note,  however,  that  some 
of  the  differences  were  in  genes  with  very  strong  cell 
cycle  regulation.  Cho  et  al  (1998)  failed  to  find  FKSl, 
GOGS,  EGT2,  two  histone  genes,  and  several  other 
genes  with  very  strong  regulation.  Before  including 
the  data  set  of  Cho  et  al,  we  had  failed  to  find  HO. 
Quite  a  few  of  the  strongly  regulated  genes  not  iden¬ 
tified  by  Cho  et  al  were  genes  known  to  be  regulated 
fairly  directly  by  Cdc28p  (in  combination  with  either 
ClnSp  or  Clb2p).  The  expression  of  such  CDC28-de- 


pendent  genes  may  have  been  altered  in  the  cdc28-13 
block-release  experiment  of  Cho  et  al 

Mechanisms  of  Transcriptional  Regulation 

The  800  genes  in  our  list  were  examined  for  the  bind¬ 
ing  sites  of  known  cell  cycle  transcription  factors,  and 
for  a  little  more  than  half  of  these  we  found  good 
matches  to  known  sites  relevant  to  the  phase  of  peak 
expression.  Moreover,  nearly  70%  of  these  same  genes 
showed  a  significant  response  to  ClnSp  or  Clb2p  in¬ 
duction  (or  both).  In  addition,  we  identified  as  cell 
cycle  regulated  other  sets  of  genes  that  form  functional 
pathways  and  are  known  to  be  coregulated  (e.g.,  the 
methionine  biosynthesis  genes)  or  about  whose  regu¬ 
lation  something  is  known  (e.g.,  the  hexose  transport¬ 
ers).  Thus,  there  are  —500  genes  for  which  we  under¬ 
stand,  at  some  level,  the  molecular  mechanism  of  cell 
cycle  control,  a  gratifying  number. 

This,  however,  leaves  —300  genes  for  which  we  do  not 
have  good  binding  sites  for  appropriate  cell  cycle  tran¬ 
scription  factors  and  have  not  identified  any  compelling 
novel  sites.  Because  these  300  genes  tend  to  be  the  less 
strongly  regulated  ones,  it  may  be  that  some  of  them 
contain  degenerate  sites  for  known  factors  and  were 
therefore  missed  in  our  searches  (which  generally  did 
not  allow  mismatches).  Alternatively,  some  of  these 
genes  may  be  controlled  by  novel  sites  that  we  failed  to 
identify;  others  may  be  controlled  by  known  factors  in  a 
combinatorial  manner  such  that  peak  expression  is  at  an 
unexpected  time.  Another  possibility  is  that  some  of 
these  transcripts  may  be  controlled  in  some  other  way, 
such  as  by  mRNA  stability.  Our  data  set  provides  fertile 
ground  for  further  investigation  of  these  questions. 

These  data  give  us  a  partial  picture  of  the  logical 
circuitry  of  transcriptional  controls  in  the  cell  cycle.  A 
large  number  of  genes  (^200)  are  induced  in  G1  and  S 
by  the  action  of  Cln3p-Cdc28p  on  MBF  and  SBF.  Fur¬ 
thermore,  by  M  they  become  repressed  by  the  action 
of  Clb2p-Cdc28p.  At  the  same  time,  Clb2p-Cdc28p, 
acting  through  MCMl  +  SFF,  induces  its  own  constel¬ 
lation  of  genes,  which  may  number  >50.  These  genes 
include  the  important  transcription  factor  Swi5p;  once 
its  transcription  has  been  induced,  and  it  is  allowed  to 
enter  the  nucleus  (perhaps  because  of  a  dip  in  Clb2p- 
Cdc28p  activity)  (Moll  et  al,  1991),  another  set  of  genes 
is  turned  on.  Among  these  is  the  CDK  inhibitor  SICl, 
which  is  important  for  deactivating  Clb2p-Cdc28p. 
The  loss  of  Clb2-Cdc28  activity  causes  a  collapse  in  the 
transcription  of  all  Clb2p-dependent  transcripts  and, 
moreover,  allows  Cln3p-Cdc28p  to  reactivate  MBF 
and  SBF  to  begin  a  new  cell  cycle.  A  gap  in  this  picture 
is  that  we  do  not  understand  the  oscillation  in  the 
genes  expressed  in  M/Gl  phase  from  an  MCMl  site 
(the  ECB).  That  is,  what  is  it  about  an  ECB  or  about 
Mcmlp  that  makes  expression  of  these  genes  cell  cycle 
regulated?  A  second  omission  is  that  we  do  not  un- 
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derstand  very  clearly  how  CLB2  is  induced  in  the  first 
place,  although  part  of  the  answer  certainly  lies  in 
regulation  of  Clb2p  protein  stability  and  Clb2p- 
Cdc28p  activity. 

Functional  Significance  of  Cell  Cycle  Regulation 

Studies  of  cell  cycle  regulation  have  focused  on  genes 
with  cell  cycle-specific  functions.  That  is,  they  are 
genes  whose  functions  are  only  needed  for  a  part  of 
the  cycle.  These  genes  are  directly  involved  in,  for 
instance,  DNA  replication,  budding,  and  mitosis.  For 
some  such  genes,  transcriptional  reflation  may  be  a 
matter  of  conserving  resources.  For  instance,  it  would 
probably  do  no  great  harm  to  express  CDC9  (DNA 
ligase)  constitutively.  However,  by  expressing  it,  and 
hundreds  of  other  similar  genes,  only  when  needed, 
the  cell  can  achieve  a  small  advantage.  Moreover, 
yeast  often  encounter  what  one  might  call  a  "Sleeping 
Beauty"  situation:  they  lie  dormant  for  weeks  or 
months,  and  then  are  required  to  resume  life  where 
they  left  off.  Genes  needed  for  cycling  are  evidently 
not  needed  during  the  dormant  period,  but  very  much 
needed  immediately  afterward.  Thus,  cell  cycle-regu¬ 
lated  expression  may  ensure  that  necessary  gene  prod¬ 
ucts  are  always  available  to  cycling  cells.  In  this  re¬ 
gard,  it  is  interesting  to  note  that  the  purine-rich  motif 
AAGAAAAA  (Parviz  et  al,  1998)  is  thought  to  be 
important  for  response  to  glucose;  this  motif  may  be 
important  in  the  switch  from  stationary  phase  to  rapid 
growth,  and  we  find  similar  motifs  enriched  in  the 
promoters  of  several  types  of  cell  cycle-regulated 
genes.  That  is,  these  genes  may  be  growth  regulated  as 
well  as  cell  cycle  regulated. 

Other  genes  with  cell  cycle-specific  functions  act  as 
regulators  or  switches.  For  these  genes,  it  is  not  only 
important  when  exactly  they  are  on  but  also  when 
they  are  off.  An  excellent  example  is  Clb2p,  which  is 
important  for  mitotic  events  but  antagonistic  toward 
G1  events.  Clb2p  is  highly  regulated:  its  transcription 
is  regulated;  the  stability  of  the  protein  is  regulated; 
and  the  activity  of  the  Clb2p-Cdc28p  complex  is  reg¬ 
ulated  by  phosphorylation,  dephosphorylation,  and 
the  binding  of  inhibitors.  Any  one  of  these  regulatory 
mechanisms  is  dispensable  for  viability,  but  if  several 
are  lost,  the  deregulated  Clb2p  is  lethal.  Thus,  tran¬ 
scriptional  regulation  of  a  gene  controlling  a  switch 
can  be  central  to  its  function. 

Cell  cycle-regulated  transcription  can  be  used  to 
build  a  structure  in  a  highly  controlled  way.  This  can 
be  illustrated  with  some  parallels  between  the  strategy 
of  the  cell  for  regulating  DNA  replication  and  its 
strategy  for  regulating  budding.  Both  processes  occur 
once  and  only  once  per  cycle;  that  is,  the  cell  must  both 
initiate  the  process  and  also  prevent  reinitiation.  In 
both  cases  the  cell  builds  an  initiation  structure,  which 
for  prereplication  complexes  contains  origin  recogni¬ 


tion  complex  components,  Mem  proteins,  and  Cdc6p, 
and  for  the  prebud  site  contains  Bud  proteins  (and 
others).  In  both  cases  transcriptional  controls  provide 
key  components  of  the  initiation  complexes  at  certain 
times  so  that  the  complexes  can  be  built  in  an  orderly 
manner;  however,  the  complexes  cannot  easily  later  be 
rebuilt  at  an  inappropriate  time,  partly  because  the 
components  are  no  longer  available.  The  building  of 
the  replication  and  budding  initiation  complexes  oc¬ 
curs  long  before  replication  or  budding  actually  occur, 
and  accordingly,  key  components  of  the  complexes 
(e.g.,  Mems,  Cdc6p,  Bud4p,  and  Bud8p)  are  provided 
in  M  phase,  long  before  they  are  used. 

These  types  of  cell  cycle-regulated  genes  are  well 
known.  However,  because  our  identifications  were 
relatively  complete  and  inclusive,  and  because  the 
method  of  identification  was  not  hypothesis  driven, 
we  were  able  to  find  cell  cycle-regulated  genes  of 
quite  unexpected  types.  In  particular,  we  found  many 
cell  cycle-regulated  genes  whose  functions  are  essen¬ 
tially  not  cell  cycle  specific.  These  include  genes  in¬ 
volved  in  secretion  and  lipid  synthesis,  which  are 
probably  needed  at  all  times,  at  least  at  some  level.  In 
these  cases,  the  role  of  cell  cycle  regulation  may  be  to 
provide  extra  transcript  when  there  is  extra  demand, 
i.e.,  at  the  time  of  budding.  The  best  single  example  of 
such  a  gene  may  be  PMAl,  encoding  the  major  plasma 
membrane  proton  pump,  a  stable  protein.  The  PMAl 
function  is  essential,  and  although  its  function  is  re¬ 
quired  throughout  the  cell  cycle  (Serrano  et  al,  1986), 
its  transcription  is  strongly  periodic.  Peak  expression 
probably  coincides  with  the  time  of  fastest  growth  of 
the  plasma  membrane  in  the  bud;  presumably  peri¬ 
odic  expression  is  needed  to  provide  the  PMAl  for  the 
daughter  cell.  In  this  case,  it  is  impractical  to  make  a 
store  of  PMAl  beforehand,  because  excess  PMAl  is 
toxic,  causing  accumulation  of  intracellular  membra¬ 
nous  structures  (Espinet  et  al,  1995).  The  cell  cycle 
regulation  of  PMAl  might  be  considered  partly  an 
answer  to  a  problem  of  stoichiometry.  Stoichiometric 
considerations  are  important  for  other  cell  cycle-regu¬ 
lated  genes  as  well,  notably  the  histones,  SPB  compo¬ 
nents,  and  microtubules. 


Conclusions 

To  summarize,  we  found  800  yeast  genes  whose  tran¬ 
scripts  oscillate  through  one  peak  per  cell  cycle.  We 
defined  these  800  genes  by  using  an  objective,  empir¬ 
ical  model  of  cell  cycle  regulation,  whose  threshold 
was  somewhat  arbitrary.  Below  this  threshold  there 
may  well  be  genes  whose  expression  is  truly  periodic 
and  whose  periodicity  might  even  have  biological  sig¬ 
nificance.  Unfortunately  we  cannot  reliably  detect 
such  genes,  but  it  is  likely  that  they  are  relatively  few 
in  number,  because  very  few  of  the  genes  known 
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beforehand  to  be  periodically  expressed  during  the 
cell  cycle  fall  below  our  threshold. 

With  respect  to  mechanism,  we  could  account  for 
the  periodicity  of  expression  of  about  half.  We  found 
relevant  DNA  motifs  upstream  of  these  genes,  and  we 
observed  independently  that  their  expression  was  af¬ 
fected  by  induction  of  Cln3p  and  Clb2p,  The  basis  of 
the  regulation  of  the  remaining  genes  remains  to  be 
elucidated,  and  some  of  the  detailed  behavior  of  some 
of  the  cyclin-dependent  gene  expression  also  remains 
to  be  explained. 

Finally,  we  hope  that  our  colleagues  in  the  scientific 
community  will  find  this  paper  to  be  valuable  not  as 
only  a  description  of  our  results  but  also  as  a  resource 
for  data  for  some  time  to  come.  We  made  measure¬ 
ments  for  virtually  every  gene  or  open  reading  frame 
in  the  yeast  genome  but  are  in  a  position  to  interpret 
explicitly  only  a  tiny  fraction  of  these  measurements. 
We  make  our  data  available,  as  did  Cho  et  ah  (1998),  in 
the  expectation  that  there  will  be  increasing  value  in 
genomic  data  sets  as  more  of  them  accumulate  and 
that  together  these  will  fully  realize  the  promise  of  the 
genome  sequencing  projects. 
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